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A biomedical engineering Ph.D. student at the University of California, Davis, Yongheng 
Wang had a problem: He wanted to identify the top 10 targets from thousands of genes 
affected by a drug he developed. The process to mine and extract information would 
normally take a huge investment of time and effort, switching back and forth between 
different programming languages and software packages. Wang was delighted to 
learn of a solution: an innovative data visualization system called “Dr. Tom" from BGI 
Genomics, a leading global provider of genomic sequencing and proteomics services, 
headquartered in Shenzhen, China. 


Developed by a team of expert scientists and bioinformaticians at BGI Genomics, Dr. 
Tom is a powerful tool for analysis, visualization, and interpretation of many types 
of data, including RNA sequencing (RNA-Seq), long noncoding RNA-Seq, microRNA- 
Seq, whole-genome bisulfite sequencing, single-cell RNA-Seq, and proteomics data. 
Having an easy learning curve, Dr. Tom is also versatile, and most importantly, 
allows researchers without backgrounds in bioinformatics to quickly, efficiently, 
and reliably create meaningful visualizations. “The intuitive interface allows us to 
select genes of interest and rank differentially expressed genes based on their fold 
changes,” says Wang. "Dr. Tom covers all the practical functionalities. | think a high 
school student could do it.” 


Designed by scientists for scientists 

Dr. Tom is a project fueled by the passion of Chong Tang, deputy director of research 
and development at BGI Tech. A biochemist and molecular biologist with expertise in 
software engineering, Tang's extensive experience helped him uncover the specific 
pain point experienced by his research colleagues around the world: Given the 
tsunami of ‘omics data, there is a critical need for a flexible, easy, and turnkey data 
analysis program. "| designed it in such a way that scientists will like it," he says. 

Dr. Tom is designed to be flexible and simple for users with limited computing 
knowledge, notes Margot Maurer, research associate in the Wolfson Centre for 
Age-Related Diseases at King's College London. Maurer uses Dr. Tom to analyze the 
transcriptomics differences in sensory ganglia to help her investigate RNA and age- 
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Innovative data visualization for gene-related research 


related diseases. “| don’t need to go through all the calculations,” she explains. “Dr. 


Tom is the bridge. With just a couple of clicks, | can see the graphs, which are easy 
to read.” No programming skills are needed for analyzing these genetic sequences, 
she adds. I really like this very visual Venn diagram comparing the different gene 
expressions in samples.” 

This visualization feature is designed to be interactive so it is easier to bounce 
ideas around the team, Tang says. He spent 5 years designing, testing, and fine- 
tuning Dr. Tom to serve his community, and now Dr. Tom's web-based interface is 
inspiring innovative research and reducing researchers’ headaches. 

“The BGI culture is a culture of scientists and for scientists.” Tang says. The 
organization considers it so vital to advance scientific discovery that it is gifting this 
software to the world. 


Making a mark 

Relying on cutting-edge sequencing and bioinformatics technology, BGI 
Genomics has provided scientists and researchers in 100 countries and regions 
with integrated solutions across a broad range of applications spanning basic 
life sciences research, clinical research in human health, and agriculture and 
biodiversity preservation and sustainability. 

Since its launch in 2018, Dr. Tom has enchanted more than 15,000 users from over 
20 countries and regions. “More than 60 published papers in the field of disease 
treatment, developmental regulation, immunity, and environmental adaptation have 
shown Dr. Tom to be a valuable and important tool in addition to any institution's 
own internal data curation and analysis efforts,” says Tang. 

BGI Genomics has partnered with academic institutions to offer workshops on 
leveraging this system effectively, and will be expanding these opportunities to 
more countries and regions in the future. 
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EDITORIAL 


Save the Supreme Court and democracy 


he US Supreme Court has been busy. It recently 
overturned a nearly 50-year-old precedent pro- 
tecting abortion rights, upheld the right to carry 

guns outside the home, and hamstrung the En- 
vironmental Protection Agency’s ability to regu- 

late emissions—all while signaling an aversion 

to contemporary empirical evidence and instead 
favoring “history and tradition.” Although the majority 
of Americans disagree with many of these decisions, the 
court has only just begun to reshape the country. When 
it resumes in October, the court will be poised to outlaw 
affirmative action, undercut federal regulations regard- 
ing clean water, and possibly allow state legislatures to 
restrict voting rights without oversight by state courts. 
What explains the court’s shift to an ideological extreme, 
and what can be done about it? 
The court’s eye-popping move 
to the conservative right is con- 
firmed by research that compares 
its decisions to public opinion. 
Decades-long surveys reveal 
that the court’s rulings were in 
step with the opinions of most 
Americans through 2020. How- 
ever, around 2021, its views on 
important issues (including abor- 
tion and religion) swerved to the 
right of those held by a majority 
of the public—and are now more 
aligned with the views of Repub- 
lican voters. This sharp turn coincided with the 2020 
death of Ruth Bader Ginsburg, a liberal-leaning justice. 
The shift can be best understood by looking at the 
identity of the court’s ideological “median voter.’ In 
considering the nine Supreme Court justices from 
most liberal to most conservative, the justice who sits 
in the median fifth position is the one whose vote will 
create a five-person majority in many cases. This gives 
the median enormous power over decisions. In 2018, 
the median justice was Anthony Kennedy, a moderate 
conservative but also an occasional cross-over voter on 
issues such as abortion and capital punishment. Upon 
retiring that year, he was replaced by staunch conser- 
vative Brett Kavanaugh, shifting the median rightward 
to Chief Justice John Roberts. As the median, Roberts 
would have kept the court more or less in line with 
the American ideological mainstream. However, when 
Ginsburg was replaced by conservative Amy Coney 
Barrett, the median moved further to the right—some- 
where around Kavanaugh or Neil Gorsuch. Although 


“What explains the 
court’s shift to an 


ideological extreme, 
and what can 
be done about it?” 


Roberts could be considered center-right, new medi- 
ans Kavanaugh and Gorsuch are firm conservatives. 
By some estimates, Gorsuch is more conservative than 
around 85% of Americans. 

Such a shift matters because public perception of 
the court’s ideology is critical. If people think the court 
is ideologically opposed to them, they will be more 
likely to think that it is acting purely politically, and 
they will be more likely to support proposals that curb 
the court’s powers. Indeed, a method of estimating 
judicial ideology based on justices’ voting—known as 
Martin-Quinn scores—suggests that the justices have 
separated into two ideological blocs defined by po- 
litical party, implying not just a strongly consolidated 
conservative majority but also less ideological “cross- 
over” voting. If the majority of 
Americans continue to strongly 
disagree with the court’s deci- 
sions, then the public may well 
turn against it, which may not 
only lead to greater political con- 
flict but also undermine the rule 
of law. 

Reforming the court to pre- 
vent extreme ideological move- 
ments may be difficult, but not 
impossible. For example, 18-year 
term limits for justices would 
regularize appointments—elimi- 
nating gamesmanship around 
vacancies and reducing incentives for justices to stra- 
tegically time retirements. This would help prevent ex- 
treme partisan imbalance and thus keep the court closer 
to the ideological mainstream. Term limits enjoy wide 
bipartisan support and would put the US in line with 
other democratic peer nations, all of which have term 
or age limits for their high courts. Additional promising 
proposals by scholars to help reduce ideological imbal- 
ance include reconfiguring how the US selects justices 
and expanding the size of the court. Others—such as 
stripping the court’s jurisdiction—would address the ar- 
gument that the court wields too much power. 

As gridlock and polarization continue to undercut 
the efficacy of elected branches of government, the Su- 
preme Court’s salience in matters of public importance 
will only rise. And the court’s conservative supermajor- 
ity (6-to-3)—one closely aligned with Republican Party 
policy interests—has many years left. The discussion of 
court reform will not go away soon. 

—-Maya Sen 
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464 We [worry it] will hinder Lilly's and Indiana's ability to 
attract diverse scientific talent from around the world. 99 


Pharmaceutical giant Eli Lilly and Co. on the state’s passage of what the 
corporation called “one of the most restrictive anti-abortion laws in the United States.” 
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PLANETARY SCIENCE 


South Korea sends orbiter to Moon to search for ice 


outh Korea’s first Moon probe was lofted into 
space from Cape Canaveral in Florida on 4 August 
by a SpaceX rocket. The $200 million Korea 
Pathfinder Lunar Orbiter, also called Danuri— 
“enjoy the Moon” in Korean—will study the Moon 
from a polar orbit for at least a year. One of the 
probe’s five instruments will capture polarized light to 
measure the grain sizes of lunar dust, an indicator of 


“weathering” by the solar wind and hence of the age 
of features such as lava flows and impact craters. In 
another first, a highly sensitive camera on Danuri will 
peek into the depths of the Moon’s permanently shad- 
owed craters to inventory the water ice known to lurk 
there. Other instruments on the satellite will find suit- 
able locations for a lander, planned for the early 2030s, 
as South Korea’s next step into deep-space exploration. 


U.S. boosts monkeypox response 


PUBLIC HEALTH | President Joe Biden’s 
administration last week designated the 
monkeypox outbreak a national public 
health emergency, allowing U.S. health 
officials easier access to funds and proce- 
dural flexibility as they respond to rising 
cases (more than 8900 as of 8 August). 
Earlier in the week, the White House 
appointed Robert Fenton, a senior official 
at the Federal Emergency Management 
Agency, as national monkeypox response 
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coordinator. Demetre Daskalakis, a physi- 
cian who directs the Division of HIV 
Prevention at the U.S. Centers for Disease 
Control and Prevention, will serve as 
deputy coordinator. Daskalakis has experi- 
ence working with the LGBTQ community; 
97.5% of monkeypox cases with available 
data on sexual behavior have been in men 
who have sex with men, according to a 

3 August report from the World Health 
Organization. As Science went to press, the 
United States had the world’s largest num- 
ber of confirmed monkeypox cases. 


Few get prompt hep C treatment 


INFECTIOUS DISEASE | Less than one-third 
of people infected with the hepatitis C 
virus in the United States who have medi- 
cal insurance receive lifesaving treatment 
for the disease within 1 year of their diag- 
nosis, the U.S. Centers for Disease Control 
and Prevention (CDC) reported this week. 
The liver-damaging virus caused 14,000 
deaths in the United States in 2019. But 
an 8- to 12-week course of pills that were 
hailed as a medical breakthrough when 
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they came to market nearly a decade ago 
can cure most infections. CDC analyzed the 
treatment records of 47,687 insured people 
from January 2019 through October 2020 
and blamed the low uptake on eligibility 
restrictions and what are known as pre- 
authorization requirements that insurance 
plans often impose. The treatment rate was 
lowest, 23%, among people on Medicaid, 
the federal program that covers low- 
income Americans. 


Small reactor design wins OK 


NUCLEAR POWER | NuScale Power, a 
startup based in Portland, Oregon, came 
closer to bringing a small, modular nuclear 
reactor to market when the U.S. Nuclear 
Regulatory Commission last week certified 
its design. NuScale says it would be safer, 
more versatile, and more economical than 
larger conventional reactors. The company 
has a deal with an electricity supplier, the 
Utah Associated Municipal Power Systems 
(UAMPS), to build a power plant compris- 
ing six of the factory-built reactors at Idaho 
National Laboratory. NuScale isn’t done 
with NRC reviews, however. The regulator 
certified the new reactor to produce 

50 megawatts of electrical power—about 
5% of the capacity of a typical power 
reactor—but the UAMPS plan now calls for 
running each of the six at a more cost- 
efficient 77 megawatts. That 54% “uprate” 
in power will require further NRC review 
before the Idaho plant can be licensed for 
operations. Other countries are also devel- 
oping small reactor designs. 


Primate center director stays on 


SCIENTIFIC INTEGRITY | Critics have 
panned a decision by the Southwest 
National Primate Research Center (SNPRC) 
to retain its director with full duties despite 
a finding by the U.S. Office of Research 
Integrity (ORI) that he engaged in research 
misconduct. As Retraction Watch first 
reported, ORI found that Deepak Kaushal 
fabricated data on testing tuberculosis 
drugs in monkeys in a study published in 
2020 and in two grant applications. He and 
colleagues produced the data while he was 
working at the Tulane National Primate 
Research Center; he became SNPRC’s direc- 
tor in 2019. Kaushal, who did not respond 
to several interview requests from Science, 
has not escaped all consequences: His 
settlement agreement with ORI requires 

a committee of senior faculty members to 
supervise his research for 1 year. SNPRC, 
based in San Antonio, said the misconduct 
finding does not affect his leadership of 
the center, however. The Texas Biomedical 
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Microscopic fluid-filled channels 
isolate HIV-infected cells 

and trap their genetic contents in 
droplets (shown) for analysis. 


Probing HIV’s cellular fortresses opens cure avenues 


research team has developed a way to assess the gene activity of single cells that 
harbor latent HIV genes—a technique that could aid the search for a cure. People 
living with HIV who take existing antiretrovirals invariably retain infected cells that 
dodge the drugs and natural immune responses. Even though scientists could 
identify these rare reservoir cells, technical constraints prevented them from evalu- 
ating the cells’ gene activity. The new method, revealed at the 24th International AIDS 
Conference last week, hinges on “microfluidic” devices (above) that allow investigators to 
retrieve genetic material from the infected cells for sequencing. The team found that the 
reservoir cells had unique patterns of gene activity, turning on genes that protect them 
from immune attack and self-destruction. Targeting these genes could, in theory, reduce, 


if not eliminate, the HIV reservoirs. 


Research Institute, which oversees SNPRC, 
noted that Kaushal’s team retracted the 
original study and published a corrected 
version, whose conclusions did not change. 
But some outside scientists say the mis- 
conduct finding could feed public doubts 
about animal research generally and should 
prevent Kaushal from leading SNPRC. 

The center received $10.3 million in federal 
funding in 2021 and houses about 

2500 marmosets, baboons, and macaques. 


Long Covid gets U.S. office 


covib-19 | President Joe Biden’s admin- 
istration last week published a plan for 
research on Long Covid, the long-lasting 
sequelae of acute SARS-CoV-2 infection, 
which the U.S. Centers for Disease Control 
and Prevention recently estimated affects 
up to one in five U.S. adults previously 
infected with the pandemic corona- 

virus. The blueprint creates an Office of 
Long COVID Research and Practice at 
the Department of Health and Human 
Services. It will be headed by Rachel 
Levine, the department’s assistant secre- 
tary for health, who has been overseeing 
the administration’s Long Covid response. 


The new research will explore diagnostics 
and treatments and build on existing stud- 
ies by the National Institutes of Health and 
the Department of Veterans Affairs to seek 
genes associated with Long Covid. 


Radar mapper declared dead 


REMOTE SENSING | Ina blow to scientists 
who monitor tiny movements of Earth’s 
surface caused by earthquakes, volcanoes, 
and glaciers, the European Space Agency 
said last week that Sentinel-1B, a radar 
mapping satellite, cannot be revived after 
a power supply malfunction caused it to 
go dark in December 2021. Investigators 
say a defective capacitor probably caused 
the fault. The loss of the satellite, launched 
in 2016, leaves the agency with a twin, 
Sentinel-1A, which was lofted into orbit 
two years earlier and is operating past its 
designed lifetime of 7 years. The duo had 
surveyed the globe, measuring surface 
changes as small as a few millimeters every 
6 days as it orbited the planet. Until the 
agency launches Sentinel-1C in 2023, the 
intervals between observations will drop 
to 12 days, hampering data collection and 
mapping efforts. 
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INFECTIOUS DISEASES 


Scientists scramble to set up 
monkeypox vaccine trials 


Logistical and ethical challenges are complicating the 


design of efficacy studies 


By Kai Kupferschmidt 


hen monkeypox suddenly started 

spreading globally in May, the 

world was fortunate in one re- 

spect: a vaccine was available. 

MVA, originally developed by 

Bavarian Nordic as a smallpox 
vaccine, was already licensed for monkey- 
pox in Canada and the United States. EU 
regulators have since followed suit. By now 
clinics have delivered thousands of doses 
to people in high-risk groups. 

There’s little doubt the vaccine can help, 
but that’s about all that’s certain. Exactly 
how well MVA protects against monkeypox 
and for how long is not known. Nor is it clear 
how much protection is lost by giving just 
a single dose rather than the recommended 
two doses, as some countries are doing to 
stretch supply, or how much protection a 
vaccine given after exposure can offer. 

But the ethical and logistical complexi- 
ties of the monkeypox crisis, which is over- 
whelmingly affecting men who have sex 
with men (MSM), are making these ques- 
tions hard to answer. Placebo-controlled 
clinical trials are fraught because MVA is 
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already licensed and people are clamoring 
to get it. And vaccine clinics are often set up 
at short notice as doses become available, 
making it harder to organize a trial and en- 
roll subjects. Researchers are responding 
with a plethora of inventive trial designs. 

The first evidence that smallpox vac- 
cines also protect against monkeypox came 
from a study in the 1980s in the Democratic 
Republic of the Congo (then called Zaire), 
where the virus occasionally jumps from 
animals to people, who then infect others 
in their household. A study among patients’ 
contacts suggested smallpox vaccination 
was 86% effective at preventing monkey- 
pox as well. But the study looked at a small 
number of cases, the virus was genetically 
quite different than the one now spreading, 
and the smallpox vaccine was an older one 
with more side effects; MVA was developed 
as a safer alternative. 

MVA was licensed for monkeypox based 
on data from animal experiments and the 
immune response it triggers in humans. 
But its efficacy has barely been tested in 
people, and not at all for preventing sex- 
ual transmission, which results in “very 
significant mucosal exposure, which is 


Empty monkeypox vaccine vials and syringes at a 
vaccination point in Valhalla, New York, on 28 July. 


not the same thing as just brushing up 
against somebody,” says Anne Rimoin, an 
epidemiologist at the University of Califor- 
nia, Los Angeles. 

So far, there’s scant data on how well the 
vaccine is working in the current outbreak. 
Among 276 individuals who received a shot 
at a Paris hospital as post-exposure pro- 
phylaxis (PEP) after reporting a high-risk 
contact, 12 developed a monkeypox infec- 
tion, French scientists reported in a recent 
preprint. But there was no control group, 
making it impossible to tell how many 
people would have developed monkey- 
pox if no one had been vaccinated. And 
people eager to be vaccinated may have 
lied about having had a high-risk contact. 
“That makes results from these studies 
on PEP really hard to evaluate,” says im- 
munologist Leif Erik Sander of the Charité 
clinic in Berlin, who’s setting up a vaccine 
study in Germany. 

A randomized trial—in which one group 
receives the vaccine and the other does 
not—would avoid such problems. Without 
a randomized study, “you can end up in 
this evidence limbo and find that if you had 
just done the trial, you would have been 
in a much better situation,” says virologist 
Natalie Dean of the University of Florida. 

Giving a control group a placebo instead 
of a presumably effective vaccine is ethi- 
cally dicey, many researchers say. But Uni- 
versity of Oxford epidemiologist Richard 
Peto sees another way. Because demand 
for the vaccine is so much higher than 
supply, “Why not randomize the order in 
which people in the highest risk group are 
called in?” Peto asks. So far, however, no 
one seems to have taken up that idea. 

Sander considered a randomized design 
but decided against it. “There was a lot of 
pushback,” he says. Instead, he has started 
a so-called cohort study in which he hopes 
to enroll 5000 vaccinated and 10,000 un- 
vaccinated people at risk of monkeypox 
and follow them for 12 months. (Over 
time, some of the unvaccinated people 
will receive the shot so the groups may be- 
come more similar in size.) So far, about 
800 people have been enrolled. 

The groups may differ in ways other than 
their vaccine status—people with lots of 
sexual contacts may try harder to get vacci- 
nated, for example—but there is still an ele- 
ment of randomization, Sander says: Many 
doctors are using lottery-type procedures to 
decide who gets the vaccine first. 

A cohort study in France is taking an- 
other approach. There, MSM already 
enrolled in a study of sexually transmit- 
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ted diseases—and deemed at high risk 
of monkeypox—will get MVA in the next 
2 months. Jade Ghosn of Bichat Hospital, 
who runs the study, hopes to have all par- 
ticipants vaccinated by the end of Septem- 
ber and plans to compare infection rates 
before and after vaccination. 

Another option is a “test-negative” de- 
sign, in which researchers look at people 
who seek testing for monkeypox and com- 
pare the percentages of people who were 
vaccinated among those who test positive 
and negative. This is “probably the stron- 
gest nonrandomized approach to measur- 
ing vaccine efficacy,’ says Michael Marks, 
an epidemiologist at the London School of 
Hygiene & Tropical Medicine who is plan- 
ning a vaccine trial soon with colleagues 
in Spain. 

The test-negative setup requires good 
linkage between vaccination and testing 
data. “If we can solve that issue we may 
use such a design in our study,’ Marks says. 
The Canadian province of Ontario is go- 
ing ahead with a similar design, says Jeff 
Kwong of the University of Toronto. The 
drawback is that testing and vaccination 
data alone can’t answer many other ques- 
tions, such as how immunity develops over 
time or whether disease severity is differ- 
ent among the vaccinated and the unvac- 
cinated; that requires additional studies. 

The U.S. National Institute of Allergy 
and Infectious Diseases (NIAID) does plan 
a randomized trial, aimed at finding out 
if the vaccine supply can be stretched by 
giving people much smaller doses. Partici- 
pants will either get two full doses or two 
one-fifth doses 4 weeks apart; a third arm 
may be added to test one-tenth of the nor- 
mal dose, says NIAID’s John Beigel, who 
is involved in designing the study. The 
lower doses will be injected into the skin, 
which is known to cause a more vigorous 
immune response than the standard sub- 
cuteanous shot. But the study, expected to 
start in September, will only test whether 
fractional doses trigger a similar reaction 
as the full dose; it won’t measure vaccine 
efficacy directly. 

One strategy not tested in the trial, even 
though it is being used, is giving just one 
full dose. Available data suggest that regi- 
men is inferior to two full doses, Beigel says: 
“We don’t think it’s scientifically supported.” 

With so many unanswered questions it’s 
hard to provide good vaccine information 
to those at risk, says Will Nutland, a U.K 
community organizer who runs an organi- 
zation for MSM sexual health. That should 
not deter people from seeking the shots, he 
says: “I think most people understand ... 
that it is better to receive some level of pro- 
tection than no protection at all.” 
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Harassment researchers decry 
proposed reporting rule 


USS. Title IX law update requiring mandatory reporting 
of sexual misconduct would cause harm, they say 


By Katie Langin 


hen Allison Cipriano read the U.S. 

Department of Education’s recently 

proposed amendments to Title IX, 

the federal statute prohibiting sex- 

ual discrimination at educational 

institutions that receive federal 
funding, she was “quickly disappointed.” 
The 700-page document includes many rule 
changes she wanted to see, including protec- 
tions for sexual minorities. But Cipriano, a 
Ph.D. student in psychology at the University 
of Nebraska, Lincoln, who studies sexual mis- 
conduct investigations in academia, and oth- 
ers are dismayed by one set of changes. They 
would compel most university employees 
to be “mandatory reporters’—required to 
notify their institution’s Title IX office of 
any alleged sexual misconduct involving 
students they become aware of, regardless 
of whether the student wants them to. 

Researchers who study such policies have 
spoken out against them, saying they re- 
traumatize targets of sexual misconduct and 
reduce their ability to seek the support they 
need. Mandatory reporting, which propo- 
nents argue helps universities unearth and 
address misconduct, is already common at 
US. universities; a 2018 study found 88% of 
institutions required most or all employees 
to serve as mandatory reporters. But the 
government’s proposed rule changes—open 
for comment until 12 September—would 
make the requirement universal, blocking 
efforts to try other approaches. 

“I was expecting all the changes to be 
good, so I’m caught off guard ... by what 
I perceive to be a potential—if it doesn’t 
get fixed—disaster,’ says Jennifer Freyd, a 
trauma researcher and professor emerit 
at the University of Oregon. She fears 
the new mandatory reporting rules will 
be particularly harmful for graduate stu- 
dents, who are often highly dependent on 
faculty members and are more vulnerable 
than undergraduates to “career-killing” re- 
taliation. “If ... a grad student can speak 
privately with a supportive faculty member 
in their department about how to navigate 
the bind they are in, it can save a career,” 
she says. 


Past regulations have required universi- 
ties to designate some employees as manda- 
tory reporters but didn’t define who those 
employees should be. The new regulations, 
in contrast, would require any university 
employee who “has the authority to in- 
stitute corrective measures” or the “re- 
sponsibility for administrative leadership, 
teaching, or advising” to serve as a manda- 
tory reporter. That definition encompasses 
most faculty members, says Lilia Cortina, 
a professor at the University of Michigan, 
Ann Arbor, who studies gender and co- 
authored the National Academies of Sci- 
ences, Engineering, and Medicine’s 2018 
sexual harassment report. 

“In the report, we talked about how 
problematic that is,’ Cortina says of broad 
mandatory reporting policies. “[There’s] 


ah: 


Faculty members could be required to report 
information shared with them in confidence. 


research showing that when you take con- 
trol away from victims, that’s actually as- 
sociated with an increase in psychological 
distress.” The requirements “essentially 
amount to nonconsensual reporting [of] 
their traumatic or humiliating or otherwise 
extremely distressing experience.” 

In a statement to Science, a Department 
of Education spokesperson wrote that the 
updates were developed “with the aim of 
ensuring full protection under Title IX for 
students, employees, and others and to 
end all forms of sex discrimination.” They 
declined to comment about why universal 
mandatory reporting is necessary, but the 
proposed regulations note that students 
“may be less capable of self-advocacy” and 
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may share Title IX violations with those who 
fall under the proposed mandatory reporter 
definition “with the expectation that doing 
so would obligate the recipient to act.” 

But targets of sexual harassment may 
go to trusted faculty members for reasons 
other than to trigger a formal investigation, 
as Cipriano found when she interviewed 
graduate students about their experiences 
with mandatory reporting. For instance, one 
student opened up to her adviser because 
she wanted to let him know she was seek- 
ing mental health services, Cipriano and her 
colleagues report in a preprint posted last 
month. “I had to do it on work time, so I felt 
like he deserved to know why I wasn’t at my 
desk,” the student said. 

Most of the students in Cipriano’s study 
weren’t aware their institution even had a 
mandatory reporting policy. “I was under the 
impression that talking to my adviser was a 
safe communication,” one said. Another told 
Cipriano she freaked out after learning a 
report would be filed. “When you’re in the 
midst of trauma ... youre not going to be 
thinking about the ins and outs of university 
policy,’ Cipriano says. “Students are not go- 
ing to see their professors as a de facto arm 
of the Title IX office.” 

At a listening session the Department of 
Education held last year to help guide the de- 
velopment of the new regulations, Cipriano 
shared research she and others have done on 
mandatory reporting—which made it all the 
more frustrating when she read the resulting 
amendments. “It really does feel like a bit of 
a slap in the face,” she says. 

Others, including Brett Sokolow, a lawyer 
who serves as president of the Association 
of Title IX Administrators, see the issue dif- 
ferently. “What most research on mandated 
reporting misses is that mandated reporting 
largely does not result in ‘forced prosecu- 
tions’ but instead results in outreach, shar- 
ing of resources, and discussion of options. 
That’s not really taking agency away from 
survivors,” he says. 

The proposed regulations will block efforts 
to test other approaches. The University of 
Oregon, for instance, has a “mandatory sup- 
porter” policy, which requires most employ- 
ees—including most faculty members—to 
tell students how they could report sexual 
misconduct and then let the student decide 
how to move forward. (Those in higher ad- 
ministrative positions, such as deans and de- 
partment heads, are mandatory reporters.) 

“We need some room for experimenta- 
tion,” says Freyd, who helped develop the 
University of Oregon policy. “I don’t think 
we're at a point yet where we have enough 
information for the federal government to 
dictate to institutions everything about how 
they should handle Title IX issues.” 
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Sunspots wax and wane with the 11-year solar cycle, as shown in an illustration of minimum (left) 
and maximum activity, but they mostly vanished for decades soon after telescopes emerged. 


ASTROPHYSICS 


Star’s midlife crisis illuminates 
our Sun’s history—and future 


Long magnetic lull mimics Maunder Minimum, 
when sunspots largely disappeared 400 years ago 


By Zack Savitsky 


oon after European astronomers de- 

veloped the first telescopes at the start 

of the 17th century, they observed dark 

spots speckling the Sun’s surface. They 

also handed their modern successors a 

mystery. From about 1645 to 1715, the 
spots, now known to be indicators of solar 
activity, all but disappeared. Gathering sun- 
spot counts and other historical observations, 
astronomer John Eddy concluded nearly 
50 years ago that the Sun had essentially 
taken a 70-year nap, which he called the 
Maunder Minimum after an astronomer cou- 
ple who had previously studied it (Science, 
18 June 1976, p. 1189). 

Now, it appears the Sun is not the only star 
that takes long naps. By building a decades- 
long record of observations of a few dozen 
stars at specific wavelengths that trace stellar 
activity, a team of astronomers has identified 
another star going through its own Maunder 
Minimum period. “I am more convinced this 
is a Maunder Minimum star than anything 
else I’ve seen,” says Jennifer van Saders, an as- 
tronomer at the University of Hawaii, Manoa, 
who was not involved in the discovery. 

The finding, reported in a preprint last 
month on arXiv, could help explain what trig- 
gered the Sun’s strange behavior 400 years 
ago and suggests more such episodes are 
likely. “This is the way to study the past and 


future of the Sun,” van Saders says. She adds 
the discovery supports a theory she and col- 
leagues have advanced: that such events are 
an occasional symptom of a critical transition 
in the magnetic field of Sun-like stars about 
halfway through their lifetime—a midlife cri- 
sis of sorts. Some astronomers speculate that 
the Sun’s transition helped favor the emer- 
gence of life on Earth, and that searching for 
stars in a similar stage could help identify 
other solar systems conducive to complex life. 

Scientists have known for decades that our 
Sun’s activity surges and ebbs on a roughly 
11-year cycle, which corresponds to how often 
its magnetic poles flip their orientation. Dur- 
ing a solar maximum, sunspots proliferate, 
marking weak points in the magnetic field, 
where plasma from the Sun’s atmosphere can 
lash out in violent loops. Astronomers have 
spotted young Sun-like stars with similar 
cycles, and older ones that have totally stable 
activity. But no one had spotted a cycling star 
suddenly turning flat. 

In 2018, as part of undergraduate research 
at Pennsylvania State University, University 
Park, Anna Baum set out to combine observa- 
tions of the telltale wavelengths from 59 stars 
taken by the Mount Wilson Observatory and 
the W. M. Keck Observatory to produce a 50- 
year chronology of star evolution. During a 
7-year gap in data while Keck was upgrading 
a detector, one star appeared to show a dras- 
tic shift. Its activity went from cycling over a 
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17-year period to being virtually flat, and it’s 
stayed that way for the past 18 years. 

Baum thought at first she’d made an error; 
perhaps the observatories were even looking 
at two different stars. But earlier this year, 
her colleagues came across additional obser- 
vations that filled in the data gap, capturing 
the star’s emissions as it switched from ac- 
tive to quiet. The recovered data set “hit the 
jackpot,” says Jacob Luhn, an astronomer at 
the University of California, Irvine, and lead 
author on the preprint. 

The discovery reinforces one popular the- 
ory about why these extended quiescent peri- 
ods happen. Stars spin more slowly with age 
because their solar winds act as “magnetic 
brakes,” like a child sticking out their arms 
while revolving in a chair. In 2016, van Saders 
and her colleague Travis Metcalfe of the 
White Dwarf Research Corporation noticed 
that at some point, stars stop hitting the 
brakes and their velocity stabilizes—a shift, 
they proposed, that stems from a change 
in the stars’ magnetic field. Then, last year, 
Dibyendu Nandi and colleagues at the Center 
of Excellence in Space Sciences India pinned 
down the idea with computer simulations 
that linked the stabilizing of the spin rate to a 
weakening magnetic field. During this transi- 
tion, as the star heads toward a “lazy” state in 
which its activity is flat rather than cycling, 
random perturbations in its magnetic field 
can result in temporary cycle shutoffs like the 
Maunder Minimum, Nandi says. 

The theory predicts that this transition 
state will emerge in middle-aged stars—just 
like our Sun and this newly identified nap- 
ping star. “Everything about this discovery 
has actually corroborated what we’ve been 
talking about for the last 5 years,’ Metcalfe 
says. “We definitely knew about stars that 
were not cycling, but we didn’t know how 
they got there—this is like the missing link in 
that evolutionary picture.” 

Our Sun’s magnetic transition probably be- 
gan around the same time life on Earth first 
crawled out of the sea, and that may be no 
coincidence, Metcalfe suggests. The incom- 
ing particles and radiation from active stars 
damage DNA and promote mutations, speed- 
ing evolution. They “may be part of the neces- 
sary ingredients to get life started,” he says. 
But at some point, energetic space weather 
poses a threat to complex life—“like a giant 
cosmic reset button that’s always going off,’ 
he adds. 

Stars undergoing a transition from cycling 
to stable could provide the ideal balance 
of spark and protection to nurture life. “If 
were looking for technological civilizations,” 
Metcalfe says, “maybe the best place to look 
is around stars that are in the second half of 
[their] lifetimes’—in other words, just enter- 
ing a midlife crisis. & 


SCIENCE science.org 


RESEARCH ETHICS 


Star marine ecologist guilty of 
misconduct, university says 


University of Delaware finding vindicates whistleblowers 


By Martin Enserink 


long-running controversy in marine 
biology took a new twist last week 
when the University of Delaware (UD) 
found one of its star scientists guilty 
of research misconduct. The univer- 
sity accepted an investigative panel’s 
conclusion that marine ecologist Danielle 
Dixson committed fabrication and _falsifi- 
cation in work on fish behavior and coral 
reefs. UD is seeking the retraction of three of 
Dixson’s papers and “has notified the appro- 
priate federal agencies,’ a spokesperson says. 

Among the papers is a study about coral 
reef recovery that Dixson published in 
Science in 2014 and for which the journal is- 
sued an Editorial Expression of Concern in 
February. Science—whose News and Editorial 
teams operate independently of each other— 
retracted that paper on 9 August. 

The investigative panel’s draft report, 
which Science’s News team has seen in heav- 
ily redacted form, paints a damning picture 
of Dixson’s scientific work. “The Committee 
was repeatedly struck by a serial pattern of 
sloppiness, poor recordkeeping, copying and 
pasting within spreadsheets, errors within 
many papers under investigation, and devia- 
tion from established animal ethics proto- 
cols,’ wrote the panel of three UD researchers. 

Dixson did not respond to requests for 
comment. She “adamantly denies any and 
all allegations of wrongdoing, and will vigor- 
ously appeal any finding of research miscon- 
duct,” her lawyer, Kristina Larsen, wrote in an 
email to Science. Dixson and two co-authors 
on the 2014 paper did not agree to the retrac- 


tion. UD says it will not comment on Dixson’s 
future there. 

Dixson is known as a highly successful sci- 
entist and fundraiser. She obtained her Ph.D. 
at James Cook University (JCU), Townsville, 
in Australia in 2012; worked as a postdoc and 
assistant professor at the Georgia Institute of 
Technology for 4 years; and in 2015 started 
her own group at UD’s marine biology lab in 
Lewes, a small town on the Atlantic Coast. 

Together with one of her Ph.D. supervi- 
sors, JCU marine biologist Philip Munday, 
Dixson pioneered research into the effects 
on fish of rising carbon dioxide levels in 
the atmosphere, which cause the oceans 
to acidify. In a series of studies published 
since 2009 they showed acidification can 
disorient fish, lead them to swim toward 
chemical cues emitted by predators, and af- 
fect their hearing and vision. Dixson’s later 
work focused on coral reef ecology. 

But a group of other marine biologists had 
long questioned the very big effect sizes and 
unusually small variances in many of Dixson 
and Munday’s results. In a 2020 Nature pa- 
per, they reported being unable to reproduce 
several of the claims. Later, four of them, 
along with former members of Dixson’s UD 
lab, decided to ask for a misconduct investi- 
gation (Science, 7 May 2021, p. 560). 

In the draft report, UD’s investigative 
committee says Dixson simply did not have 
enough time to collect the vast amount of 
data described in the Science paper, which 
purported to show that overfished, seaweed- 
covered reefs in Fiji fail to attract juvenile 
individuals of 15 fish and three coral spe- 
cies. The draft report also confirms claims 


Brit - ‘ PS 


Danielle Dixson studied coral reefs in Fiji for her 2014 study in Science, which has now been retracted. 
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by the whistleblowers and independent sta- 
tistical experts that a large Excel file of the 
study’s raw data was riddled with inexplica- 
bly duplicated columns. 

One of the paper’s two co-authors, Geor- 
gia Tech marine ecologist Mark Hay, was 
Dixson’s supervisor at the time. He says 
he wasn’t in Fiji during the study. But 
Dixson “was among the most hard-working 
and persistent colleagues I’ve worked with,” 
and the amount of data she reported “did not 
seem unreasonable given Dr. Dixson’s time in 
Fiji and strong work ethic,” he says. Hay says 
he has not seen the panel’s report, but “if UD 
found fault with her studies, I’m surprised.” 
Dixson “seemed honest and genuine.” Hay 
says he was interviewed by a separate in- 
vestigative committee at Georgia Tech but 
does not know the status or outcome of that 
investigation. A Georgia Tech spokesperson 
did not respond to a request for comment. 

Dixson and Hay published another paper 
in Science in 2012 showing corals can “re- 
cruit” plant-eating fish to trim toxic seaweeds. 
“We have been discussing the 2012 paper but 
were not given any information about it by 
the university,’ says Holden Thorp, Science’s 
editor-in-chief, “so we will begin 
our usual process to determine 
whether retraction or correc- 
tion is warranted.” 

The draft report also found 
misconduct in a 2016 paper 
on whether anemone fishes 
can sniff out the condition of 
potential host anemones, pub- 
lished in the Proceedings of 
the Royal Society B by Dixson 
and marine biologist Anna 
Scott of Southern Cross University in Aus- 
tralia. Again, the timeline was implausible, 
the committee concluded. Collecting the 
data would have taken 22 working days of 
12 hours, it wrote, “working continuously 
without any breaks.” Yet the paper said the 
study was done in 13 days, between 12 and 
24 November 2014. 

Scott and Dixson posted a correction to 
the Proceedings B paper in early July, stat- 
ing the studies actually took place between 
5 October and 7 November 2014, adding 
20 days to the timeline. They also uploaded 
the raw data for the study, which had been 
missing. But the whistleblowers say that file 
also has duplications and other problems. 

Scott tells Sctence she signed off on the 
correction. She did not respond to ques- 
tions about UD’s misconduct findings or the 
alleged problems in the data set. Spencer 
Barrett, editor-in-chief of Proceedings B, 
says UD has asked for a retraction of the 
paper after the correction ran, and that 
an investigative panel at the journal will 
now reconsider the issue. The journal is 


of Science an 
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still investigating another paper by Dixson, 
Barrett says. 

The UD committee noted that a 2014 pa- 
per in Nature Climate Change co-authored 
by Dixson and Munday also “had serious 
issues regarding the datasheet files,” which 
it called “signatures of fabrication and fal- 
sification of data.” But it’s unclear from the 
redacted draft report whether the commit- 
tee thinks that paper should be retracted. 
On another paper co-authored by Munday, 
in Animal Behaviour in 2012, the panel 
notes “extreme effect sizes and low to zero 
variance” in the data, but says, because 
no notebooks or files exist from the study, 
“there is insufficient evidence to support 
a finding of research misconduct.” The 
same is true for a 2009 Ecology Letters pa- 
per that Munday co-authored. Munday, in 
an email to Science, says he is “shocked to 
hear the outcome of the investigation” but 
declined to elaborate. 

The outcome is vindication for the group 
of whistleblowers, whose work had sharply 
divided marine ecologists, with some scien- 
tists accusing them of acting out of profes- 


The University 
of Delaware 


“did a decent 


investigation.” 


Fredrik Jutfelt, 
Norwegian University 


sional envy or to advance their own careers. 
UD “did a decent investigation. 
I think it’s one of the first uni- 
versities that we’ve seen actu- 
ally do that,” says one of the 
whistleblowers, Fredrik Jutfelt 
of the Norwegian University 
of Science and Technology. 
But he is disappointed that 
the committee appears to only 
have looked at seven of the 
20 papers that the team flagged 
as suspicious. The group had 
also hoped UD would release the commit- 
tee’s final report and detail any sanctions 
against Dixson. 

One of the whistleblowers, former Ph.D. 
student Paul Leingang, worked at Dixson’s 
lab but started to secretly monitor her work 
after he became increasingly suspicious. 
Leingang, who left UD in February 2021 after 
formally accusing Dixson of misconduct— 
and has since left science altogether— 
provided the investigation with detailed 
notes, chat conversations, and other evi- 
dence to show she did not spend enough 
time on her studies to collect the data she 
was jotting down in her lab notebooks. 

The investigative panel found Leingang’s 
account convincing and singled him out for 
praise. “It is very difficult for a young scholar 
seeking a Ph.D. to challenge their advisor on 
ethical grounds,” the draft report says. “The 
Committee believes it took great bravery for 
him to come forward so explicitly.” 


d Technology 
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ASTRONOMY 


Webb reveals 
early universe’s 
galactic bounty 


Star formation after the big 
bang appears much faster 
than models had forecast 


By Daniel Clery 


he James Webb Space Telescope has 

only been watching the sky for a few 

weeks, and it has already delivered 

a startling finding: tens, hundreds, 

maybe even 1000 times more bright 

galaxies in the early universe than 
astronomers anticipated. 

“No one was expecting anything like this,” 
says Michael Boylan-Kolchin of the Univer- 
sity of Texas, Austin. “Galaxies are exploding 
out of the woodwork,’ says Rachel Somerville 
of the Flatiron Institute. 

Galaxy formation models may now need a 
revision, as current ones hold that gas clouds 
should be far slower to coalesce into stars 
and galaxies than is suggested by Webb’s 
galaxy-rich images of the early universe, less 
than 500 million years after the big bang. 
“This is way outside the box of what models 
were predicting,’ says Garth Illingworth of 
the University of California (UC), Santa Cruz. 

Webb, a NASA-led orbiting observatory with 
contributions from the European and Cana- 
dian space agencies, began observing in late 
June from its vantage point 1.5 million kilo- 
meters from Earth. Much of its time so far 
has been devoted to projects meant to show 
off its capabilities, such as the Cosmic Evolu- 
tion Early Release Science (CEERS) Survey. 
Webb is designed to delve deeper into cos- 
mic history than its predecessor, the Hubble 
Space Telescope. Its 6.5-meter mirror—with 
six times the area of Hubble’s—can catch 
more light from distant sources, and unlike 
Hubble it operates at infrared wavelengths, 
making Webb more sensitive to those far- 
away sources, whose light is stretched to lon- 
ger, redder wavelengths by cosmic expansion. 

Within days after Webb began observa- 
tions, it spotted a candidate galaxy that ap- 
pears to have been shining brightly when the 
universe was just 230 million years old, 1.7% 
of its current age, which would make it the 
most distant ever seen. Surveys since then 
have shown that object is just one of a stun- 
ning profusion of early galaxies, each small 
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by today’s standards, but more luminous 
than astronomers had expected. 

Some researchers caution that the abun- 
dance, based on images of a small patch of 
sky, may be an illusion. Boylan-Kolchin won- 
ders whether Webb just got “extra lucky” and 
stared into a huge clump of galaxies, denser 
than the rest of the early universe. That ques- 
tion will be resolved when CEERS broadens 
its scope later this year and results come in 
from other wide-ranging surveys. 

It is also possible that astronomers are 
misidentifying galaxies from slightly more 
recent times as very early ones. Spectra are 
the gold standard for gauging a galaxy’s age 
because they allow the reddening of its light 
to be measured precisely. 
But gathering spectra from 
many galaxies takes time. 
Instead, Webb surveys so 
far have estimated galaxy 
ages from the color they ap- 
pear in images—a relatively 
crude method. Webb’s near- 
infrared camera filters their 
light into a few wide wave- 
length bins, giving astrono- 
mers a rough measurement 
of color; redder equals more 
distant. But dust surround- 
ing a galaxy can fool observ- 
ers, as it can absorb starlight and re-emit 
it at longer wavelengths, making the galaxy 
look redder. 

Webb’s early science teams have al- 
ready identified a few such masquerading 
galaxies, as they report in several recent 
preprints. But if the profusion of early 
galaxies is real, astronomers may have to 
fundamentally rethink galaxy formation or 
the reigning cosmology. 

Viewing nearby galaxies, researchers 
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The James Webb Space 
Telescope captured this galaxy 
at a record distance. 


have concluded that heat within gas clouds 
slows how quickly gravity would otherwise 
condense the matter into stars—making 
star formation take about 100 times longer 
than if gravity alone was in charge. As the 
first stars in a protogalaxy begin to shine, 
they inject more heat into the gas, pumping 
the brakes on further star formation. And 
the first stars are short-lived giants; when 
they explode as supernovae they heat up 
gas clouds even more or blast them out of a 
forming galaxy completely. 

Studies with Hubble have shown that the 
rate of star formation has been relatively 
constant as far back as about 600 million 
years after the big bang, says Charlotte 
Mason of the Niels Bohr In- 
stitute. But the Webb results 
imply that at earlier times 
its pace was much more 
rapid—as fast, Somerville 
suggests, as if gas clouds 
were collapsing freely, with- 
out any braking from heat 
or supernovae. 

Indeed, Tommaso Treu of 
UC Los Angeles, who leads 
another Webb survey called 
GLASS, says his team is 
seeing these early galaxies 
“form stars like crazy.’ They 
look, he adds, “like giant balls of star for- 
mation and nothing else.” 

Theorists don’t know if the higher den- 
sity of matter and higher temperatures of 
the early universe might have sped star 
formation. Another theory is that the first 
stars could have formed faster because they 
took shape from just the primordial matter 
leftover from the big bang—hydrogen and 
helium—without the heavier elements forged 
by later generations of stars. 


Members of the Cosmic 
Evolution Early Release Science 
Survey team analyze a 

recent galaxy-strewn James 
Webb Space Telescope image. 


Or something may be wrong in the current 
understanding of how the universe evolves. 
The prevailing theory of cosmology, known 
as lambda-CDM (referring to cold dark mat- 
ter), describes how, soon after the big bang, 
the unseen dark matter that makes up most 
of the stuff of the universe clumped together 
under its own gravity into “halos.” These ha- 
los then drew in normal matter and created 
the conditions for it to condense into galax- 
ies. Lambda-CDM predicts the number and 
size of halos that should exist in the early 
universe, and hence the number of galaxies. 
“There’s not much wiggle room,’ Boylan- 
Kolchin says. 

Somerville says it may be possible to 
tweak lambda-CDM to create something 
closer to what Webb is seeing. Or, she says, 
cosmo-logists may be forced to reassess the 
first moments of the big bang itself: the era 
of inflation, a period of rapid growth when 
quantum fluctuations grew into areas of 
higher or lower matter density—the seeds 
of later halos. “If inflation is wrong that 
could be very fundamental,” she says. “But 
I wouldn’t bet on it being that.” 

Having revealed the early galaxies prob- 
lem, Webb may provide the data needed to 
answer it. So far Webb is only seeing young, 
hot, bright stars in the newfound early gal- 
axies. Follow-up observations of these gal- 
axies at longer wavelengths with Webb’s 
midinfrared instrument or ground-based 
radio telescopes sensitive to submillimeter 
waves could reveal the gas clouds actively 
building stars. Those observations might 
help astronomers confirm that early galax- 
ies were unusually prodigious star facto- 
ries—and hold clues to how they did so. 

“In 6 months we'll have a much better 
picture of all this,’ Boylan-Kolchin says. “It’s 
a very exciting time.” 
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Starship will be the 
biggest rocket ever. Are 
space scientists ready 
to take advantage of it? 


By Sarah Scoles 


ASA’s Lunar Crater Observation 
and Sensing Satellite mission was 
brutish and short. It began on 
9 October 2009, when the hull 
of a spent Centaur rocket stage 
smashed into Cabeus crater, near 
the south pole of the Moon, with 
the force of about 2 tons of TNT. 
And it ended minutes later, when 
a trailing spacecraft flew through and ana- 
lyzed the lofted plume of debris before it, 
too, crashed. About 6% of the plume was 
water, presumably from ice trapped in the 
shadowed depths of the crater, where the 
temperature never rises above -173°C. The 
Moon, it turned out, wasn’t as bone dry as 
the Apollo astronauts believed. “That was 
our first ground truth that there is water ice,” 
says Jennifer Heldmann, a planetary scien- 
tist at NASA’s Ames Research Center who 
worked on the mission. 

Today, Heldmann wants to send another 
rocket to probe lunar ice—but not on a 
one-way trip. She has her eye on Starship, 
a behemoth under development by private 
rocket company SpaceX that would be the 
largest flying object the world has ever 
seen. With Starship, Heldmann could send 
100 tons to the Moon, more than twice the 
lunar payload of the Saturn V, the work- 
horse of the Apollo missions. She dreams of 
delivering robotic excavators and drills and 
retrieving ice in freezers onboard Starship, 
which could return to Earth with tens of 
tons of cargo. By analyzing characteristics 


SpaceX expects to launch the 120-meter-tall Starship 
(left) on its first orbital test flight in the coming 
months. In 2021, workers stacked a Starship upper 
stage on a Super Heavy booster (right). 


science.org SCIENCE 


PHOTO: SPACEX 


NEWS 


PHOTO: SPACEX 


such as the ice’s isotopic composition and 
its depth, she could learn about its origin: 
how much of it came from a bombardment 
of comets and asteroids billions of years 
ago versus slow, steady implantation by the 
solar wind. She could also find out where 
the ice is abundant and pure enough to sup- 
port human outposts. “It’s high-priority sci- 
ence, and it’s also critical for exploration,” 
Heldmann says. 

When SpaceX CEO Elon Musk talks up 
Starship, it’s mostly about human explo- 
ration: Set up bases on Mars and make 
humans a multiplanetary species! Save civi- 
lization from extinction! But Heldmann and 
many others believe the heavy lifter could 
also radically change the way space scien- 
tists work. They could fly bigger and heavier 
instruments more often—and much more 
cheaply, if SpaceX’s projections of cargo 
launch costs as low as $10 per kilogram are 
to be believed. On Mars, they could deploy 
rovers not as one-offs, but in herds. Space 
telescopes could grow, and fleets of satellites 
in low-Earth orbit could become common- 
place. Astronomy, planetary science, and 
Earth observation could all boldly go, better 
than they ever have before. 

Of course, Starship isn’t real yet. All eyes 
will be on a first orbital launch test, expected 
sometime in the coming months. Even if it is 
a success, no one knows whether SpaceX will 
be able to achieve its vision of launching the 
rockets daily and reusing them many times. 
Also unsettled is whether a market will ma- 
terialize for a rocket that could put so much 
into orbit. But scientists need to prepare, 
Heldmann says. “We on the science side need 
to be ready to take advantage of those capa- 
bilities when they come online.” 


SCIENCE science.org 


So do NASA centers such as the Jet Pro- 
pulsion Laboratory, which designs and 
builds many space science missions, says 
Casey Handmer, a former JPL software en- 
gineer. In a series of provocative blog posts 
with titles like “Starship is still not under- 
stood,” he has argued that Starship will upset 
the traditional way of doing space science— 
spending billions of dollars to make one-of- 
a-kind instruments that work perfectly. If the 
NASA centers don’t find ways to take risks 
and make more stuff more cheaply, he says, 
they will find themselves displaced by com- 
panies willing to do so. “The writing is on 
the wall?’ Handmer says. “And all the NASA 
centers should be thinking really carefully.” 


ON A BALMY NIGHT in February, Musk strode 
onto a stage in Boca Chica, Texas, home of 
SpaceX’s Starbase launch site, for a public 
update on the status of Starship. Towering 
behind him, bathed in lights, was the latest 
prototype, about 120 meters tall: the Starship 
vehicle, which carries people or payloads, 
resting on top of a Super Heavy booster. 
The prototype wasn’t flight ready, nor had 
the Federal Aviation Administration (FAA) 
given SpaceX permission to launch it from 
Starbase—but it was still a spectacular back- 
drop, packed with coiled purpose. After wel- 
coming the crowd of faithful rocket geeks, 
Musk launched into an impromptu lecture 
on the philosophy propelling him and his 
company beyond Earth. “Why build a gi- 
ant, reusable rocket? Why make life multi- 
planetary? I think this is an incredibly impor- 
tant thing for the future of life itself” 
SpaceX’s workhorse rocket, the 70-meter- 
tall Falcon 9, has already shaken up the aero- 
space business. With that rocket, SpaceX 


pioneered reusability, employing retrorockets 
and steerable fins to guide the first stage to a 
landing after it re-enters the atmosphere. To- 
day, SpaceX routinely slaps on a fresh coat of 
paint and launches it again; in June, the com- 
pany flew one of these “flight tested” stages 
a record 13th time. Another record is on the 
horizon: The company is on track to launch 
more than 50 Falcon 9 and Falcon Heavy 
rockets this year, or about one per week on 
average. The dependable reuse and rapid 
launch cadence are two of the reasons why 
SpaceX can charge $67 million for a Falcon 
9 launch, much less than its competitors. But 
Musk wasn’t satisfied. 

In 2016, at an International Astronauti- 
cal Congress in Mexico, Musk sketched out 
plans for a rocket to colonize Mars, one 
he would soon be calling BFR (Big Falcon 
Rocket, in family-friendly terms, but you get 
the joke). The concept evolved into Starship, 
but the focus remained on affordability and 
reusability—making launches as dull and 
routine as FedEx cargo flights. The body of 
the rocket is stainless steel, heavier than the 
aluminum alloys of most rockets, but cheaper 
and more easily manufactured. The 33 Rap- 
tor engines crammed into the backend of 
Super Heavy burn methane rather than the 
traditional kerosene-based rocket fuels, not 
only because it is cheaper, but also because 
it could be harvested on Mars by combin- 
ing carbon dioxide and water. The booster is 
designed to return to the launchpad after its 
6-minute ride; the company believes it can 
be refueled and ready to relaunch in an hour. 
Starship is also reusable. The goal is to be 
able to launch each vehicle three times a day. 

Once in orbit, a loaded Starship could be 
gassed up by a “tanker” version of the vehicle— 
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enabling it to take its 100 tons of payload 
on to the Moon or Mars. At the February 
event, Musk explained how a single Starship, 
launching three times per week, would loft 
more than 15,000 tons to orbit in a year— 
about as much as all the cargo that has been 
lifted in the entire history of spaceflight. Musk 
has claimed the price of each launch might 
eventually be as low as $1 million, or $10 per 
kilogram to low-Earth orbit. The only rocket 
close to Starship in its capabilities is NASA’s 
Space Launch System, set to fly for the first 
time this month. Earlier this year, the agency’s 
auditor found each launch would cost about 
$4 billion, or nearly $60,000 per kilogram. 

Pierre Lionnet, a space economist at Euro- 
space, an industry trade group, is skeptical 
SpaceX can achieve such a low price point. 
It may not correctly account for the costs of 
developing and building the rocket, for ex- 
ample. “When I look at Starship, ’m looking 
at what seems to be a very expensive device.” 
To achieve profitability with such high capital 
costs, SpaceX will have to attain its ambitious 
launch rates, which means it will need pay- 
ing customers to soak up all that cargo ca- 
pacity. SpaceX hopes to develop new markets 
in space mining, tourism, or other activities 
not yet dreamed of, but Lionnet is not so sure 
the heavy lifter will whet that appetite all by 
itself. “If you're vegetarian, and I’m offering 
you a burger, I can offer it at the cheapest 
possible price, and you don’t eat it.” 

The debate will soon graduate beyond 
the theoretical. In May 2021, after several 
spectacularly explosive failures, a Starship 
upper stage flew 10 kilometers up into the 
atmosphere. After landing, it briefly caught 
fire, but the company deemed the suborbital 
flight a success. Since then, SpaceX has built 
out Starbase, constructing a launch tower 
that can catch returning boosters with two 
robotic arms the company calls “chopsticks.” 
It has refined its rocket assembly line, which 
can now build four Raptor engines per week. 
And in June, FAA gave SpaceX approval to 
launch from Starbase, provided it takes steps 
to minimize the impact on the environment. 

At the February event, Musk said he was 
confident Starship would make its first or- 
bital attempt this year. For Musk, the sci-fi 
dreams are tantalizingly within reach. “Let’s 
make this real!” he exhorted the crowd, 
pumping his fists. 


SCIENCE HAS MOSTLY been an afterthought 
for Musk. But Heldmann has been surprised 
that, for many planetary scientists, Starship 
has also been an afterthought. 

In 2020, she and a team of researchers 
and industry insiders submitted a white pa- 
per touting the benefits of Starship to the 
“decadal survey” in planetary science, an 
influential community exercise that helps 
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Heavy lifters 

Starship would be the largest rocket the 
world has ever seen. It could also 

offer the cheapest launches. To achieve 
that, SpaceX will need to launch fleets 

of Starships daily—and it will need 
customers for the immense cargo capacity. 


SPACE SHUTTLE 
Debut 1931 
Height 56m 
Cost/kg* $65,400 


FALCON 9 
Debut 2010 
Height 70 m 
Cost/kg $2600 


SPACE LAUNCH 
SYSTEM (BLOCK 1) 
Debut 2022 

Height 98 m 


(ot 


STARSHIP 
Debut 2022 
Height 120m 
Cost/kg $10 


SATURN V 
Debut 1967 
Height 111m 
Cost/kg $5400 


Cost/kg $58,000 


*Cost per kilogram to low-Earth orbit in 2021 dollars 


Long hauler 
By refueling in orbit, each Starship can carry its full 100-ton payload to Mars. On Mars, water, 
carbon dioxide, and electricity can be used to make methane fuel. 


3 Tankers 
refuel ship and 


1 Booster lifts 
ship and returns 
to launch pad. 


2 Ship 
reaches 
Earth orbit. 


NASA and Congress set long-term priori- 
ties. “It’s a good time to try and get this idea 
in the consciousness of others,’ she says. 
Heldmann and her colleagues suggested 
NASA create a dedicated funding line for 
missions relying on Starship. 

The survey embraced the ideas. In its 
April report, the survey committee explic- 
itly mentioned Starship and cited ideas in 
Heldmann’s paper. The committee recom- 
mended a funding line relevant to Starship’s 
specs and said NASA should plan to capital- 
ize on the rocket’s potential. “Both cargo and 
crew flights to Mars offer significant potential 


return to Earth. 


4 Refueled 
ship travels 
to Mars. 


5 Ship refueled 
using martian 
resources. 


6 Ship 
returns directly 
to Earth. 


science opportunities,’ the committee said. 
The benefits wouldn’t be limited to the 
Moon and Mars, points out Daniel Baker, di- 
rector of the Laboratory for Atmospheric and 
Space Physics at the University of Colorado, 
Boulder. For instance, Mercury, with its weak 
gravity, has been a tough target because of 
the extra fuel required to slow a spacecraft 
enough to get into orbit; the Sun’s heat is an- 
other problem. But Starship wouldn’t mind 
the big gas tank or the sunshade needed to 
keep the spacecraft from melting. Baker 
also envisions faster missions to the outer 
planets that don’t require time-consuming 
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gravitational assists from other planets. Even 
farther afield, Interstellar Probe, a proposed 
mission to follow in the footsteps of NASA's 
famed Voyager mission (Science, 29 July, 
p. 460), could carry more capable instru- 
ments aboard Starship—and get a faster ride 
to interstellar space. 

Some astronomers also have Starship in 
their eyes. “There’s no way to talk about it 
without resorting to cliches, but ‘best rocket 
engine ever; probably, by most metrics,” says 
David Rubin, a cosmologist at the University 
of Hawaii, Manoa. He wonders how much 
simpler the $10 billion James Webb Space 
Telescope (JWST) might have been if its 
6.5-meter-wide segmented mirror hadn’t had 
to fold up to fit on its rocket. Engineers could 
have built a monolithic mirror and launched 
it as is within the 9-meter-wide Starship fair- 
ing, which encloses a volume about 
half as big as a hot air balloon. 

Rubin also dreams of using 
Starship to construct a_ giant 
telescope—say 30 meters—in space. 
Limbed robots could precisely lay 
down mirror segments on a scaf- 
folding, forming a giant mirror that 
could pick out the universe’s first galaxies 
and look for signs of life in the atmospheres 
of Earth-like exoplanets. “The science gains 
scale really quickly as you build larger and 
larger telescopes,’ Rubin says. 

Not all astronomers are fans of SpaceX, 
which has already launched nearly 3000 of 
its Starlink internet satellites into low or- 
bits, where sunlight glinting off them leaves 
streaks on the cameras of ground-based tele- 
scopes. The problems could multiply with 
Starship, which could launch hundreds of 
Starlinks at a time, enabling the company 
to build its planned constellation of up to 
42,000 satellites even faster. SpaceX now 
equips the satellites with “sunshades” to re- 
duce the reflective glare, but astronomers 
are still worried. “Making access to space 
and Earth orbit easier has a lot of benefits,” 
says Meredith Rawls, an astronomer at the 
University of Washington, Seattle, and a 
member of an International Astronomical 
Union center set up to mitigate satellite in- 
terference. “But we need to make sure that 
were doing it in a mindful way and not just 
having it be a Wild West disaster.” 

As another reality check for the dreamers, 
Lionnet points out that discounted rides will 
only reduce the cost of missions by so much. 
For major scientific projects, Lionnet says, 
launch costs are usually between just 5% and 
10% of the total price tag. For JWST, the frac- 
tion was even smaller. The typical cost for a 
ride on an Ariane 5 rocket, JWST’s launcher, 
is about $175 million, just 2% of the mission’s 
total price tag. “A complex telescope will still 
be a complex telescope,” Lionnet says. 
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But cheaper launches could allow the 
probes themselves to be cheaper, with less 
need for space-rated parts that save on 
weight or bulk. With Starship, planetary 
scientists wanting to outfit a rover with a 
spectrometer could just buy one on the in- 
ternet. Astronomers could use a glass mirror 
instead of a featherweight beryllium one, 
like JWST’s. And, Rubin says, “You should 
just be able to shield your way into radiation 
hardness,” rather than soldering circuits out 
of specialized materials. 

Earth-observation researchers have al- 
ready been down that road. For many 
years, remote sensing satellites were big 
and pricey—little different from JWST, says 
Aravind Ravichandran, founder of the space- 
industry consultancy TerraWatch Space. But 
about a decade ago came the “small satel- 


“If thousands and thousands of tons are 
going to orbit, someone's going to figure out how 
to put a telescope up there without NASA.” 


David Rubin, University of Hawaii, Manoa 


lite” revolution: Researchers shrank and 
standardized equipment and took advantage 
of rideshares on relatively cheap Falcon 9 
launches and other small rockets. 

Suddenly, university students were send- 
ing shoebox-size CubeSats to space. Using 
cheap cameras and consumer electronics, 
the company Planet built up a fleet of about 
200 small satellites that gather daily images 
of all of Earth’s land. Ravichandran sees Star- 
ship making it easier to assemble ever bigger 
fleets—enough eyes on the planet to revisit 
a given spot multiple times an hour, rather 
than every few hours, days, or weeks. “Why 
can’t you do every 5 minutes? Every 10 min- 
utes?” Ravichandran asks. Imagine, he says, 
what that kind of revisit rate might do for, 
say, tracking wildfires or floods. 


HANDMER, WHO NOW WORKS as a clean- 
energy entrepreneur, wants astronomers 
and planetary scientists to adopt this sort 
of bold thinking. Instead of a 30-meter tele- 
scope, why not a 1000-meter one? Why not 
mass-produce probes that could survey doz- 
ens of asteroids? Why not fly by all the outer 
planets in the next decade? Or land on most 
planets annually? 

In Handmer’s view, the problem is partly 
cultural: NASA engineers try to get every- 
thing right on the first try, at all costs—the 
vastly expensive, long-delayed JWST being 
a prime example. “It’s kind of like a medi- 
eval cathedral,” he says, of such flagship mis- 
sions. To exploit Starship’s immense capacity, 
Handmer estimates NASA will need to make 
100 times as much stuff for a fraction of the 


usual cost. It will need to be a fast-fashion 
factory, not a boutique. But having worked at 
JPL, Handmer isn’t necessarily hopeful that 
will happen. “It was just not set up to mass- 
produce anything,” he says. 

Rubin says NASA centers could get left 
behind by nimbler companies, or privately 
funded scientists. “If thousands and thou- 
sands of tons are going to orbit, someone’s 
going to figure out how to put a telescope 
up there without NASA,” he says. Astro- 
botic, which calls itself a “lunar logistics 
company,’ has sprung up with this sort of 
business model. It plans to send landers 
and rovers to the Moon, carrying instru- 
ments for paying customers. 

Robert Manning, JPL’s chief engineer, 
doesn’t think the facility is quite so resistant 
to change. But he also questions Handmer’s 
vision of cheap, mass-produced 
probes. The equipment used at the 
scientific frontier is rarely stan- 
dard. Every mission, with its fresh 
targets and questions, requires in- 
novation. And it’s hard to take risks 
as a public agency, Manning says. 
“We have an obligation to make 
sure that we are not wasteful of taxpayers’ 
dollars,” he says. “We can’t throw things to 
space frivolously and say, ‘Well, if it doesn’t 
work, let’s build another one.” 

In 1992, then-NASA Administrator 
Daniel Goldin pushed the agency to pur- 
sue a “faster, better, cheaper” approach— 
a mantra that was discarded later in the 
decade after high-profile losses of the Mars 
Climate Orbiter and Mars Polar Lander. If 
a faster, better, cheaper culture is to return, 
Manning says both NASA and Congress 
will have to bless the risk-taking—and 
stand up for it when things go wrong. “It’s 
going to be difficult to politically commu- 
nicate that that’s OK for us to try it out,” 
Manning says. 

But assuming all those issues worked 
themselves out, he acknowledges that 
frequent, low-cost, high-mass-and-volume 
launches could provide “an incredible op- 
portunity for us to change how we get things 
done and be willing to take more risks.” JPL 
has already been thinking about it, he says— 
for example, considering how to incorporate 
standardized, lower cost components in 
NASA’s custom deep-space missions. 

There’s a major asterisk on the rocket 
revolution that Starship heralds. “We haven't 
been able to act on it yet,’ Manning says, 
“because it’s not true yet.” When Starship 
makes its first orbital launch attempt, many 
researchers will be watching, waiting to see 
whether that giant silvery rocket is a vision 
of space science’s future, or a mirage. 


Sarah Scoles is a journalist in Denver. 


12 AUGUST 2022 * VOL 377 ISSUE 6607. 705 


When less is more in the evolution of language 


Did loss of vocal fold membranes typical of nonhuman primates enable human speech? 


By Harold Gouzoules 


ost studies of the biological evolu- 
tion of language address human 
features, both cognitive and neuro- 
logical, that appear absent, or less 
well developed, in nonhuman spe- 
cies. A picture emerges of what has 
been added to the broader vocal and com- 
municative abilities of nonhuman primates 
to produce language. For example, insights 
into the linguistic capabilities and limita- 
tions of the great apes came from projects in 
which primate subjects were tutored to com- 
municate with humans using either a form of 
sign language, physical tokens, or lexigrams. 
Extant apes appear to have most of the cog- 
nitive skills for language but lack shared in- 
tentionality and a capacity for speech (7). On 
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page 760 of this issue, Nishimura et al. (2) 
provide evidence that loss of vocal fold mem- 
branes that are universally present in extant 
nonhuman primates resulted in a stable 
vocal source in humans that was critical to 
speech evolution. 

Speech and language are critically re- 
lated, but not synonymous. Speech is the 
audible sound-based manner of language 
expression and humans, alone among the 
primates, can produce it. Sound produc- 
tion mechanisms in human and nonhu- 
man primates are similar, with air from 
the lungs driving oscillations of vocal folds 
in the larynx. The acoustical energy gener- 
ated then passes through the pharyngeal, 
oral, and nasal cavities, and emerges in 
a form that is governed by the filtering 
of specific frequencies by the vocal tract. 
This filtering mechanism is important to 
speech, generating the formant frequency 
patterns that convey the sounds of lan- 


guage. However, this aspect of vocalization 
production is not distinct to humans, and 
all mammals appear to perceive formants 
in conspecific calls (3). 

Identifying the adaptations that enabled 
speech has been challenging, and many 
unanswered questions remain. Notably, 
the core findings of Nishimura et al. are 
not even hinted at in earlier studies of the 
evolution of speech (3, 4). Indeed, most re- 
cent research attention has focused away 
from the peripheral structural dimensions 
of speech production to central neural con- 
trol mechanisms (3, 4). Peripheral features 
that had received earlier interest included 
hypothesized speech-related specializations 
of the human vocal tract, in particular the 
descent of the tongue root into the pharynx. 
For years this feature was considered a dis- 
tinctly human trait associated with speech 
evolution because it was thought to provide 
a greatly expanded phonetic range relative 
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Nonhuman primates, such as this chimpanzee 
(Pan troglodytes), possess vocal fold membranes 
that were lost in the evolution of human speech. 


to other primates (5). But it turns out that 
the vocal range of nonhuman primates had 
been previously underestimated, and the 
descended larynx is not distinctly human 
(3). Thus, the descended larynx is clearly 
not sufficient for speech evolution, but it 
might have been a necessary step. 

The study of Nishimura et al. refocuses 
attention on the peripheral adaptations for 
speech. Using magnetic resonance imaging 
(MRI) and computed tomography (CT), the 
authors examined larynges from 29 genera 
and 44 species of primates and found that 
all the nonhuman taxa possessed a vocal 
membrane that is absent in humans. They 
also documented membrane activity during 
vocalizations in chimpanzees, squirrel mon- 
keys, and rhesus macaques and used math- 
ematical modeling to compare the acoustic 
effects of nonhuman vocal membrane vi- 
bration to that of vocal fold vibration alone, 
which is characteristic of humans. From 
this evidence, they propose that selection 
for the elimination of the laryngeal air sacs 
and vocal fold membranes (thin upward ex- 
tensions of the vocal folds, or “vocal cords,” 
that are universally found in the larynx of 
nonhuman primates) was critical to speech 
evolution because this loss resulted in a 
stable vocal source. 

Although the vocal repertoires of many 
nonhuman primate species include some 
clear, harmonically structured calls, their 
vocalizations frequently bifurcate to sub- 
harmonics or chaos at higher intensities. 
The vocal membrane that has been lost in 
humans is implicated in generating these 
nonlinear phenomena. The loss of the vo- 
cal membrane in humans therefore reduces 
the risk of chaotic irregularities and noise 
contaminating the stable vocal fold oscil- 
lations that are critical to human speech. 
According to the authors, this stability, 
combined with fine neural control involving 
extensive dorsal intracortical connections 
between motor and auditory cortices, yields 
the quality of vocal signal needed to rapidly 
encode complex linguistic information. 

What makes this study especially note- 
worthy is that the authors conclude that the 
human larynx differs from that of extant 
nonhuman primates in that it has become 
simpler or less complex with the loss of the 
membranes. Evolutionary biology textbooks 
emphasize that natural selection does not 
inevitably produce greater complexity. For 
example, the Mexican cavefish (Astyanax 
mexicanus) no longer has eyes, and there 
is genetic, developmental, and physiologi- 
cal evidence for adaptive eye regression in 
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this species and other cave animals (6). In 
essence, the cave dwellers represent a “use 
it or lose it” evolutionary scenario. But what 
did humans lose with the loss of their vocal 
membranes (and air sacs)? 

In nonhuman species, including pri- 
mates, nonlinear vocal phenomena (NLP) 
are prominent features of communicative 
calls. NLP vocalizations sound harsh and 
have been hypothesized to lower perceived 
voice pitch, exaggerating the body size of 
the vocalizer to the listener (7). This sug- 
gests that selection can favor anatomical 
structures and physiological mechanisms 
that promote vocalization instability and 
support complex subharmonics and chaos. 
Although the descended larynx in humans 
likely offers some compensation with re- 
spect to lowering pitch, the loss of vocal 
membranes and the concomitant abate- 
ment of NLP would seem to reduce this di- 
mension of human nonverbal vocal commu- 
nication. Despite the absence of vocal fold 
membranes, humans retain the ability to 
produce vocalizations (often intentionally 
and voluntarily) with nonlinear and chaotic 
features, such as screams. Indeed, human 
screams are evolutionarily conserved acous- 
tically and resemble those of other mamma- 
lian species (8, 9). Humans perceive and re- 
spond to NLP in vocalizations in ways that 
are consistent with animal communication 
studies. NLP lowers perceived voice pitch, 
increases acoustic roughness, and causes 
vocalizers to sound larger, more formidable, 
and more aggressive (10). With the loss of 
vocal fold membranes, humans have ap- 
parently reduced the production of NLP in 
speech, but still retain the ability to gener- 
ate harsh sounds in nonverbal vocalizations 
such as screams and roars, a complexity of 
vocal control that has yet to be adequately 
investigated. If there has been a loss of a 
tendency or capacity to exaggerate size vo- 
cally in humans, there are other means to 
offset them: Humans can, of course, speak 
softly and carry a big stick. 
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MEDICAL DEVICES 


Ultra-sounding 
out a technique 
that sticks 


Ultrasound can be used to 
promote the physical 
interlocking of adhesives 
and tissues 


By Julien Es Sayed and Marleen Kamperman 


dvances in robotics, surgical instru- 
mentation, and real-time imaging in- 
side the body have made it possible to 
switch various surgeries to minimally 
invasive ones. However, for many 
surgical procedures that require re- 
connecting tissue or attaching prosthetic 
materials, a minimally invasive way to per- 
form them does not yet exist. Stapling and 
suturing are harsh techniques that can cause 
local tissue damage from deep piercing. In 
some cases, their removal after wound heal- 
ing can also inflict damage. Tissue adhesives 
could potentially reduce the harm of surgical 
procedures, decrease operative times, and 
improve the surgical outcomes. Although 


“Tissue adhesives could 
potentially reduce the harm of 
surgical procedures, decrease 
operative times, and improve 
the surgical outcomes.” 


promising approaches have been developed, 
tissue adhesives for wound closure are now 
mainly used in combination with sutures 
and staples. On page 751 of this issue, Ma et 
al. (1) introduce an ultrasound-mediated ap- 
proach to enhance the adhesive performance 
of soft hydrogels on various tissues. 
Researchers have long been searching for 
adhesives that can withstand the environ- 
ment of the human body (2). It is difficult to 
“glue” things together when they are wet and 
slippery. Water acts as a boundary layer be- 
tween adhesive and tissue, preventing bonds 
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from forming. Mucous membranes on the 
surfaces of internal organs, such as the gastro- 
intestinal tract and lungs, also act as a barrier 
for adhesives (3). Some tissues also change 
shape constantly, such as the beating heart or 
the breathing diaphragm, making adhesive- 
tissue bonding difficult to attain. One ap- 
proach to achieve adhesion is to chemically 
modify the adhesive with functional groups 
such as amines, carboxylic acids, and thiols. 
Peptide coupling reactions can then create a 
strong bond between the modified adhesive 
and tissue proteins (4). Challenges in devel- 
oping such an adhesive include timing the 
chemical reaction in the time window of the 
surgical procedure; tuning the properties of 
the adhesive to the chemistry of the specific 
tissue; preventing inflamma- 
tion triggered by unreacted resi- 
dues, degradation, or reaction 
by-products; and ensuring that 
the generated adhesive joint can 
withstand stretching (5). Ma et 
al. address the latter challenge 
by increasing the toughness of 
the adhesive joint. 

Toughness of an adhesive 
joint is characterized by the 
energy needed to break it. The 
more an adhesive joint can 
stretch and absorb energy, the 


Chain penetration 
or interlocking 
Primer polymer 
chains are propelled 
into the tissue by 


higher pH (9). In this approach, water is no 
longer considered detrimental to adhesion 
but instead helps facilitate polymer diffu- 
sion into the tissue. Still, the relatively slow 
diffusion of polymer chains inside the tissue 
limits the speed of the adhesion process. 
Building on these considerations, Ma et 
al. advance the topological adhesion con- 
cept by using ultrasound to propel polymer 
chains deeper and faster into the biological 
tissue. These sound waves prepare biologi- 
cal tissue to interact with a hydrogel that is 
placed on top (see the figure). Before estab- 
lishing contact with the hydrogel, a natural 
polymer solution (e.g., chitosan, cellulose 
nanocrystals, or gelatin) is spread on the tis- 
sue, and the contact area is then treated with 


Ultrasound-enhanced topological 
adhesion to tissues 


Tough adhesion between tissues and hydrogels has been achieved by stitching 
them together with anchored dangling polymer chains. Ma et al. show that 
ultrasound treatment enhances the interlocking of the chains in the tissue. 
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ultrasound. The ultrasound creates cavitation 
microbubbles that propel the primer mole- 
cules into the tissue, resulting in strong me- 
chanical interlocking. In addition to this en- 
tanglement, the primer polymer chains also 
form either electrostatic, hydrophobic, or 
hydrogen bonds with the surrounding tissue. 
This combination of topological and physical 
affinity results in strong adhesion between 
hydrogel and tissue without the need for a 
chemical reaction. 

The concept of using ultrasound to en- 
hance the permeability of tissues has been 
successfully applied for drug delivery sys- 
tems (10), which may make the translational 
process to the clinic faster and easier. In the 
work of Ma et al., and that of many other re- 
search groups, it becomes clear 
that large variations between 
tissues lead to substantial dif- 
ferences in the effectiveness of 
adhesion strategies. Future re- 
search should focus on tissue- 
specific adhesive development. 
Better insight is needed into 
the mechanical characteristics 
and surface chemistry of differ- 
ent tissues, including damaged 
and diseased tissues, because 
this is where adhesives will be 
used most often. In addition, 


Primer 
solution 


harder it is to break. Stretching 
reduces the force per unit area 
and consequently decreases the 
likelihood that the adhesive 
joint will fail (6). An effective 
way to increase toughness is by 
interlocking polymer chains into 
the substrate (7). This design 
principle takes advantage of the 
roughness as well as the rela- 
tive permeability of biological 
tissues and hydrogels in an ap- 
proach called topological adhe- 
sion (8, 9). This technique relies 
on first diffusing polymer chains 
into the surface of the tissue 
substrate and then inducing the 
formation of strong interchain 
interactions between the poly- 
mer chains and the substrate by 
manipulating the environmental 
conditions. The average size and 
concentration of the polymer 
chains, the affinity for the sol- 
vent as well as for the substrate, 
and the interactions between 
polymer chains are all important 
parameters for achieving effec- 
tive interlocking. For example, 
a pH-sensitive natural polymer 
such as chitosan can diffuse into 
a biological tissue at low pH and 
form strong hydrogen bonds at 


708 


ultrasound. 
Penetration 
depth 


Epidermis 


Dermis 


Hydrogel bonding 

A hydrogel is applied to 
the tissue. The polymer 
chains interlock into 
the hydrogel, and 
hydrogen bonds form 
between the polymer 
chains. 


Debonding 
mechanism 

As the hydrogel is 
removed, the 
interlocked polymer 
chains anchored 

in the tissue are 
able to stretch and 
deform, which 
drastically increases 
the toughness of the 
adhesive joint by 
spreading the stress 
over a larger volume. 
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specific adhesives will need to 
match tissues in such a way as 
to actively promote repair by 
means of cell or drug delivery. 
In this way, tissue-specific bio- 
active adhesives can help the 
healing process by stimulating 
the patient’s body to use its own 
systems to recreate cells and re- 
build tissues and organs. 


REFERENCES AND NOTES 


1. Z.Maetal., Science 377,751 (2022). 

2. Y.J.Z.Young, P.B. Medawar, Lancet 
236, 126 (1940). 

3. C.Karamaoun, B. Sobac, B. Mauroy, 
A.Van Muylem, B. Haut, PLOS ONE 13, 
e0199319 (2018). 

4. C.Cui,W. Liu, Prog. Polym. Sci.116, 
101388 (2021). 

5. P.A.Leggat, D.R. Smith, U. Kedjarune, 
ANZ/J. Surg.77,209 (2007). 

6. C.Creton, M. Ciccotti, Rep. Prog. Phys. 
79, 046601 (2016). 

7. L.Leger,C. Creton, Philos. Trans. R. Soc. 

London Ser. A 366, 1425 (2008). 
. J.Lietal., Science 357,378 (2017). 
. J.Yang, R. Bai, Z. Suo, Adv. Mater. 30, 
e1800671 (2018). 
10. M.R.Prausnitz, R. Langer, Nat. 
Biotechnol. 26, 1261 (2008). 


ACKNOWLEDGMENTS 

Support for this work was provided by the 
European Research Council (European 
Union's Horizon 2020 research and innova- 
tion program, consolidator grant agreement 
no. 864982). 


10.1126/science.abq7021 


science.org SCIENCE 


GRAPHIC: V. ALTOUNIAN/SCIENCE 


PERFLUOROCARBONS 


Perfluorocubane—a tiny electron guzzler 


Perfluorination gives cubane the capacity to host an extra electron in its inner structure 


By Marie Pierre Krafft! and Jean G. Riess? 


igh symmetry arouses a sense of per- 
fection and perennity. The highly sym- 
metrical cubane molecule (C,H,), de- 
spite its improbable bonding angles, 
has been synthesized using a complex, 
multistep approach (7). Unexpectedly, 
the structurally strained and thermodynami- 
cally unstable molecule proved to be inert. In 
the molecular world, symmetry impedes re- 
activity because no weak spot is available for 
intrusion. Achieving perfluorination—that 
is, replacing all hydrogens of cubane with 
much larger fluorine atoms—was a chal- 
lenge, with the reward of creating a perfect 
nest for hosting an additional electron, thus 
generating a radical anion. On page 756 of 
this issue, Sugiyama et al. (2) report the syn- 
thesis of perfluorocubane, C,F,, contributing 
to the understanding of electron-accepting 
perfluorinated polyhedral structures. 

Since its inception (J), cubane has in- 
spired many investigations, derivatiza- 
tions, functionalizations, and sugges- 
tions for potential uses, particularly in 
medicine and materials science (3-9). 
Perfluorination drastically changes the 
structure and electron charge distribu- 
tion of cubane. Compared with hydrogen, 
fluorine—with its nine protons and nine 
electrons—is much larger and has a higher 
electron density, ionization potential, elec- 
tron affinity, and electronegativity as well 
as lower polarizability. Perfluorocarbons 
are among the most thermally stable and 
chemically inert materials ever synthe- 
sized. Perfluorination generates new prop- 
erties or enhances known ones to levels 
often unattainable otherwise (10). Specific 
features include combined hydrophobicity 
and lipophobicity, combined oxygen-carry- 
ing capacity and biocompatibility, extreme 
surface activity, and low-temperature re- 
silience (10). Cautious use is needed, how- 
ever, because perfluorochemicals can raise 
environmental problems, particularly as a 
result of persistence and accumulation in 
living organisms (1/7). Regarding perfluoro- 
cubane, the predicted property of netting 
an electron in a carbon polyhedron (12) 
offered a tantalizing additional challenge. 
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To synthesize C,F,, Sugiyama et al. selected 
direct fluorination—i.e., replacement of the 
hydrogens with fluorines using fluorine mol- 
ecules (F,). The reaction of F, with organic 
material is highly exothermic and provokes 
profuse carbon-carbon (C-C) bond _ break- 
ing. Taming such explosive reactions is not 
straightforward. Direct fluorination was orig- 
inally done by diluting F, in an inert gas, typi- 
cally nitrogen, at low temperature in a chlo- 
rofluorocarbon solvent (13). The technique 
was later improved by attaching a fluorinated 
ligand to the molecule that is to be perfluo- 
rinated (14). Heptafluorocubane (C,HF,) was 
thus obtained, and the remaining hydrogen 
was replaced by a flourine using standard 
monofluorination reactions to produce C,F,. 

Notably, the experiments of Sugiyama 
et al. showed little C-C bond length change 
upon perfluorination. This implies that the 
C-C bond shortening expected from highly 
electronegative fluorine substitution is coun- 


“Perfluorination causes 
sweeping steric and electronic 
alterations, entailing 
profound changes in properties 
and reactivity...” 


tered by the increased spatial needs of the 
fluorine atoms. Their data also revealed a 
rare directional noncovalent carbon bond 
between a fluorine atom and the nearest per- 
fluorocubane molecule. Most notably, their 
data confirmed the prediction that the over- 
lap (so-called sigma stellation) of all eight 
antibonding orbitals of the C-F bonds form 
an extended star-shaped molecular orbital. 
The result of this sigma stellation is a pro- 
nounced electron affinity and the possibility 
to host an electron inside the eight-carbon 
cubic frame, generating the radical anion 
C,F,~ (12). Distinctively, the spin density of 
the hosted electron is chiefly located inside 
the cubane frame, in contrast to the common 
electron acceptors that store extra electrons 
on their surface. The radical anion is, how- 
ever, unstable. 

Perfluorination causes sweeping steric 
and electronic alterations, entailing pro- 
found changes in properties and reactivity, 
making forecasts hazardous. Only further 


experimentation and modeling studies will 
tell what properties and reactivity perfluoro- 
cubane has. Future studies may investigate 
the synthesis of other functional fluorinated 
cubane derivatives, devising means of intro- 
ducing such building blocks into molecules 
and polymers and assessing their chemistry 
elicited by the extra electron hosted inside 
the structure. The rare carbon-bonding do- 
nor capacity could be explored using cyanide 
and similarly elongated anions. The use of 
functional perfluorocubane building blocks 
as linkers, spacers, or appendages in poly- 
mers may also be rewarding. 

The inspiring asset of perfluorocubane 
and derivatives lies in their distinctive elec- 
tron hosting capacity and redox activity. 
Perfluorocubane-derived compounds may 
offer an opportunity to experimentally de- 
termine what differentiates an internally 
netted electron from an externally located 
one. New means of grafting a small, space- 
filling electron-sequestering building block 
into molecules would be valuable for de- 
veloping redox-active, functional electron- 
carrying and -transferring materials, which 
could find applications in organic conduct- 
ing and semiconducting materials, magnetic 
frameworks, or photothermal materials. 
Currently, the availability and cost of func- 
tional perfluorocubane-derived reagents on 
an appropriate scale may impede extensive 
investigations. The findings of Sugiyama et 
al. also warrant the investigation of the per- 
fluorination of further polyhedral molecular 
structures—especially those susceptible to 
achieving sigma stellation—as well as explo- 
ration of their electron chemistry. 
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ACTIVE MATTER 


A soft active matter that can climb walls 


Mechanical activity of an active fluid can be used to control its dynamics at the boundaries 


By Jérémie Palacci 


f you mix fruit syrups with alcohol to 
make a schnapps, the two liquids will 
remain perfectly blended forever. But 
if you mix oil with vinegar to make a 
vinaigrette, the oil and vinegar will 
soon separate back into their previous 
selves. Such liquid-liquid phase separation 
is a thermodynamically driven phenom- 
enon and plays an important role in many 
biological processes (J). Although energy 
injection at the macroscale can reverse 
the phase separation—a strong shake is 
the normal response to a separated vinai- 


To investigate the phenomenon of liquid- 
liquid phase separation, the authors used 
a mixture of the polymer poly(ethylene 
glycol) (PEG) with dextran, a polysaccha- 
ride derived from glucose. Because both 
PEG and dextran are biocompatible, they 
can be combined with functional protein 
nanomotors—kinesins. PEG-dextran can 
phase separate into PEG-rich drops in a 
dextran-rich phase (3). In addition, PEG 
and dextran exhibit ultralow surface ten- 
sion, in which their phase boundaries are 
a thousand times flimsier than in an oil- 
water mixture and deform with minimal 
mechanical stress. 


A“super” fluid, but not a superfluid 


When experimenting with a polymer poly(ethylene glycol) (PEG)—dextran mixture with molecular motors and 


A “normal” liquid, 
such as water, rises 
up on the wall and 
forms a meniscus 
because of capillarity 
forces and the affinity 
of water for glass. 


Ww 


“Normal” liquid 


r 


A superfluid, such as 
liquid helium, can 
climb all the way up 
and over the wall, 

as surface tension 
pulls and the lack 

of dissipation allows 
the fluid to move. 


grette—little is known about the effect of 
energy added at the microscopic level on 
phase separation. This fundamental ques- 
tion has deep ramifications, notably in bi- 
ology, because active processes also make 
the interior of a living cell different from a 
dead one. On page 768 of this issue, Adkins 
et al. (2) examine how mechanical activity 
at the microscopic scale affects liquid-liq- 
uid phase separation and allows liquids to 
climb surfaces. 
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microtubules, Adkins et al. discovered a nonequilibrium wetti 


ng transition that allows the active fluid to climb walls. 


The PEG-dextran mixture 
climbs higher than a 


“normal” liquid, but only 

when there is sufficient flow 

inside the liquid, which can 
es 


be induced by the molecular 
motors and microtubules. 
As a result of this “activity,” 
a thick wetting layer can 
climb further than 
the equilibrium 
capillary rise, 
showing a 
wetting 
transition 
driven by 
activity. 


PEG-dextran 
mixture ee 


“4, 
ae Molecular 


motor 


Adkins et al. investigated how energy 
injected at the microscopic scale affects 
the phase separation between PEG and 
dextran. To do so, they made the fluid “ac- 
tive’—that is, one that contains individual 
constituents that exert forces—by adding 
kinesin molecular motors and microtu- 
bules to the fluid, a combination that has 
been done before (4). Powered by the ki- 
nesin molecular motors, the microtubules 
continuously slide past each other, lead- 
ing to chaotic flows inside the fluid. The 
strength of those flows characterizes the 
activity of the fluid. When the kinesin and 


microtubules are added to a PEG-dextran 
fluid undergoing phase separation, the ki- 
nesin motors and microtubules exclusively 
reside in the dextran phase, whereas the 
PEG phase remains passive. 

The authors observed the behavior of 
the passive drops of PEG immersed in 
the active fluid powered by kinesins mo- 
tors. At moderate activity of the fluid, the 
growth of the PEG droplets during phase 
separation appeared faster than in a pas- 
sive system. The activity in the dextran 
phase enhanced the movement of the PEG 
droplets and favored the droplets to col- 
lide and merge with each other. In effect, 
the system behaved as a hot thermal bath. 
Unexpectedly, at high activity, the PEG 
droplets stopped growing, and the phase 
separation arrested. At this point, the fluid 
mixture reached a steady state, in which 
larger droplets shed off smaller droplets at 
the same rate that smaller droplets merged 
to form large droplets. The authors posit 
that deformation of the boundary between 
the two phases is caused by active stresses 
of the fluid competing with the surface ten- 
sion between the two phases of PEG and 
dextran. This competition can be observed 
as interfacial waves travel along the PEG- 
dextran boundary, and the interface shows 
giant fluctuations in shape. This model is 
consistent with simulations based on a hy- 
drodynamic model, which agree with the 
observations and furthermore allow the es- 
timation of active stresses, a fundamental 
but elusive parameter of the mechanics of 
active systems. 

On the basis of their results, the authors 
then provide an example of how active 
stresses of liquids may be leveraged to cre- 
ate a “super” fluid, in which internal forces 
allow the active fluids to climb against 
gravity—almost like a superfluid but with 
broadly different physical reasons (see the 
figure). Such a liquid can be used as an 
experimental platform for active, shape- 
shifting soft materials. 

Another notable feature of this study is 
the connection established between ther- 
modynamics and active matter. Active mat- 
ter is a research area that focuses on the 
emerging and dynamic properties of a large 
number of active agents, such as a flock 
of starlings or a colony of bacteria. These 
systems can be found across all length and 
time scales in nature. The field has expe- 
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rienced a surge in the past decade, with 
important experimental progress both on 
the synthetic (5-8) and biology side (9, 10) 
and conceptual breakthroughs highlighting 
flocking behavior (8, 17), nonequilibrium 
phase separation (12), and the role of topol- 
ogy and topological defects in controlling 
the dynamics of active systems (73-15). 

The work by Adkins et al. opens a new 
horizon for active matter, in which an ac- 
tive phase can exert forces and generate 
flows to control the behavior of a passive 
system. Active matter constitutes a potent 
toolbox to devise new phases of matter, le- 
veraging the interplay of an active phase 
onto a “passive one.” The study of these 
hybrid materials addresses fundamental 
questions, such as the role of activity to 
control the phase diagram or to promote 
phases of matter that have no conventional 
equivalent. The effect of the activity on the 
thermodynamics of liquid-liquid phase 
separation also remains an open question. 


“This work... will be inspirational 
for others to investigate the 
rich interplay between active 
and passive phases...” 


Active stresses can couple to passive con- 
stituents and control their organization. 
Its relevance in biology as well as the de- 
sign of adaptable soft materials and ma- 
chines remains to be explored. A consider- 
able challenge is to establish experimental 
platforms to devise and study those phases 
of matter. This work is an important step 
in that direction. It will be inspirational 
for others to investigate the rich interplay 
between active and passive phases—an un- 
charted territory. 
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SYNTHETIC BIOLOGY 


Synthetic gene circuits take root 


Complex spatial patterns of gene expression are 
engineered in plants to modulate root morphology 


By Simon Alamos! and Patrick M. Shih*234 


long-standing aim of synthetic biology 

has been to engineer genetic circuits 

that are able to confer prescribed spa- 

tiotemporal patterns of gene expres- 

sion. This feat would not only unlock 

the next tier in the technology tree of 
translational biology but would also consti- 
tute a powerful demonstration of a predic- 
tive and quantitative basic understanding of 
genetic regulation in higher eukaryotes. On 
page 747 of this issue, Brophy et al. (1) re- 
port a strategy to engineer spatial transcrip- 
tional patterns in the root of the model plant 
Arabidopsis thaliana to alter its morphology. 
This effort constitutes a milestone in the ge- 
netic engineering of a whole, fully developed 
multicellular organism and points to the 
challenges ahead. 

Early studies of the cis and trans regula- 
tion of plant and animal genes showed that 
spatiotemporal patterns of transcription fol- 
low a combinatorial logic (2, 3). Whether a 
gene is expressed in a cell type depends on 
the combination of DNA binding sites in its 
regulatory region and whether these sites 
are occupied by activator or repressor tran- 
scription factors (TFs) present in these cells. 
These regulatory combinations can be de- 
scribed as boolean logical operations. For ex- 
ample, a gene may be transcribed only if its 
activator is present while its repressor is ab- 
sent, a logic operation known as a “NIMPLY 
gate.” It also became clear that TF proteins 
themselves are modular, with DNA binding 
and regulatory activity being physically and 
functionally separated (4). Together, these 
findings inspired what might be considered 
the earliest synthetic biology tools (5). It 
became possible to fuse heterologous DNA 
binding domains to transcription activation 
domains and express these synthetic TFs in 
specific cell types. Genes under the control 
of synthetic promoters that bear binding 
sites to these synthetic TFs could thus be 
expressed in specific cells and/or under de- 
fined conditions (5). 
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Although these tools proved invaluable 
for basic biology, the spatiotemporal control 
of gene expression in whole organisms re- 
mained modest. Plant genetic engineering 
has barely explored the combinatorial logic 
of cis and trans elements that underpin 
plant developmental complexity. Synthetic 
efforts in the Drosophila melanogaster em- 
bryo—a prime model for animal genetic cir- 
cuit interrogation—have so far been limited 
to simple activation, a long way from the 
choreography of repressors and activators 
that establish the fly body plan (6). Thus, 
the repertoire of transcriptional logical op- 
erations engineered in complex eukaryotes 
is still limited, in contrast to microbes and 
cultured cells (7). Indeed, there persists a 
humbling gap between our understand- 
ing of the endogenous genetic circuitry 
that controls gene expression patterns in 
animals and plants and our ability (or lack 
thereof) to engineer these patterns (8). 

These limitations stem in part from a 
dearth of “genetic parts” that can perform 
logical operations in whole organisms. This 
is not entirely unexpected because engineer- 
ing synthetic TFs and the synthetic promot- 
ers responsive to them can take multiple it- 
erations of testing and optimization, whereas 
the turnaround time to generate a transgenic 
plant or animal is several months. To close 
this gap, Brophy ez al. turned to a model sys- 
tem that is somewhat intermediate between 
cell culture and a transgenic plant. It is possi- 
ble to transiently express multiple transgenes 
in the leaves of the tobacco relative Nicotiana 
benthamiana and measure the circuit per- 
formance using green fluorescent protein 
(GFP) fluorescence in just 2 days (see the 
figure). Using this platform, they developed 
a library of synthetic TFs based on bacterial 
DNA binding domains and eukaryotic activa- 
tion and repression domains. Combined with 
synthetic promoters that bear bacterial DNA 
binding sites, these form an impressive col- 
lection of parts that can perform all major 
logical operations. 

Given the completeness of this toolbox, it 
should, in principle, be possible to use cell 
type-specific promoters to drive synthetic 
TFs and create spatially embedded logical 
operations in a cell type-specific manner. 
Of all multicellular model systems, the A. 
thaliana root has emerged as one of the 
premier systems for this type of experiment 
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owing to its relative simplicity and acces- 
sibility for imaging. Moreover, cell type- 
specific messenger RNA (mRNA) patterns 
in the root have been thoroughly charac- 
terized and have resulted in a collection of 
tissue-specific promoters. In contrast to the 
findings of Brophy et al., which were ob- 
tained using transient expression in N. ben- 
thamiana, synthetic circuits designed to 
perform spatial logical operations in stably 
transformed A. thaliana tended to deviate 
from their expected behavior. For example, 
it was found that the activity of some syn- 
thetic TFs varied greatly across cell types 
depending on the promoter under which 
they were expressed. The authors carefully 
documented these negative findings and 
proposed concrete strategies to overcome 
them. The field would benefit greatly from 
such transparency. Chief among the debug- 
ging strategies was simply swapping syn- 
thetic cis and trans elements, which high- 


behaviors that are only beginning to be ap- 
preciated, much less understood. It is likely 
that a deeper basic understanding of multi- 
cellularity and development will be necessary 
to leverage synthetic biology for translational 
applications in plants and animals. 

As the term suggests, translational re- 
search depends on prior basic biology. 
Translating implies a sequential hierarchy: 
discover first, engineer second. Synthetic 
biology flips this script, arguing that this 
sequential hierarchy can be productively in- 
verted (9). This “build to understand” ethos 
could be the path forward. Synthetic biology 
approaches tailored to embrace the com- 
plexity of developing organisms are well-po- 
sitioned to provide sophisticated and quan- 
titative tools to dissect the inner workings 
of genetic networks specifically in an organ- 
ismal context. Discovering the intricacies of 
genetic networks in whole organisms would 
in turn enable more advanced synthetic bi- 


Synthetic 
transcriptional 
logic gates 

Logic gates such 

as NIMPLY can be 
implemented at the 
transcriptional level 

to engineer biological 
computation. Brophy 

et al. developed a library 
of different synthetic 
transcription factors 
and promoters. These 
tools were optimized 
and then used to drive 
root development 

in Arabidopsis thaliana. 
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Rapid engineering of 
transcriptional logic gates 


Expressing synthetic transcription factors 
and promoters in Nicotiana benthamiana 


allows rapid design- 


GFP, green fluorescent protein 


lights the importance of a diverse toolbox. 
These results underscore that those genetic 
tools that perform well at a lower level of 
biological complexity are not guaranteed 
to work in a more complex context. This is 
a cautionary tale for those in the broader 
synthetic biology community who are seek- 
ing to engineer multicellular organisms. 
Models such as mammalian cell culture 
or transient expression in N. benthami- 
ana allow for a speed and throughput that 
is unthinkable in whole organisms. These 
simpler systems have been pivotal in the 
development of molecular toolboxes for 
bioengineers such as the one introduced by 
Brophy et al. However, implementing these 
tools in a transgenic animal or plant line is 
far from trivial. Why this should be the case 
is an interesting question that the field will 
have to grapple with in the future. Cells in 
intact tissues give rise to emergent collective 
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Spatial synthetic circuits 
in whole organisms 
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ology applications, creating a virtuous cycle. 
Moving forward, the A. thaliana root could 
serve as an ideal model to better understand 
the principles of natural and engineered 
gene circuits in an organismal context. The 
lessons learned from Brophy et al. are key 
to the future success of implementing and 
translating combinatorial circuits not just 
into plants but more broadly into other 
complex biological systems. 


build-test cycles. 


REFERENCES AND NOTES 


. J.A.N. Brophy et al., Science 377, 747 (2022). 

. D.Stanojevié et al., Nature 341, 331 (1989). 

. P.N. Benfey, N.-H. Chua, Science 250, 959 (1990). 

. L.Keeganetal., Science 231, 699 (1986). 

. J. Haseloff, Methods Cell Biol.58, 139 (1999). 

. J. Crocker et al., Cell Rep.18, 287 (2017). 

. S.Auslander, M. Fussenegger, Cold Spring Harb. 
Perspect. Biol. 8,a023895 (2016). 

. B.J.Vincentetal., Integr. Biol. 8,475 (2016). 

M. Elowitz, W.A. Lim, Nature 468, 889 (2010). 


NMORWNHE 


Rome) 


10.1126/science.add6805 


MOLECULAR BIOLOGY 


Activating 
translation 
with phase 
separation 


Ribonucleoprotein granules 
allow activation of 
translation to complete 
mouse spermatogenesis 


By Anne Ramat and Martine Simonelig 


hase separation allows cellular or- 

ganization through the coordination 

of biochemical reactions into mem- 

braneless biomolecular condensates 

(1). Studies have focused on the bio- 

physical properties and dynamics 
of biomolecular condensates. Now, a key 
issue to address is the biological functions 
of these condensates: Can phase separa- 
tion induce specific functions and, if so, 
how (2, 3)? In the cytoplasm, messenger 
RNAs (mRNAs) and RNA binding proteins 
assemble into biomolecular condensates 
called ribonucleoprotein (RNP) granules 
that are intimately linked to RNA biology. 
RNP granules have been associated with 
mRNA storage and translational repression. 
On page 727 of this issue, Kang et al. (4) re- 
port translational activation as a previously 
unknown function of biomolecular conden- 
sates, showing that phase separation of the 
RNA binding protein FXR1 (fragile X men- 
tal retardation syndrome-related protein 1) 
is required for completion of spermatogen- 
esis in mice. 

Since the pioneering study that applied a 
concept well known in polymer chemistry to 
understand the dynamics of Caenorhabditis 
elegans germline P granules (5), liquid-liquid 
phase separation has become a flourishing 
topic in cell biology. Phase separation occurs 
when molecules locally reach a concentra- 
tion threshold at which they demix from the 
surrounding milieu, leading to the formation 
of biomolecular condensates. Demixing de- 
pends on a multitude of multivalent inter- 
actions within and between nucleic acids 
and proteins that contain intrinsically dis- 
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ordered regions, low-complexity 
domains, or multiple interaction 
domains (6). 

Cytoplasmic RNP granules 
have been linked to mRNA stor- 
age and translational repression. 
This is the case for stress gran- 
ules that form upon environ- 
mental stress and store mRNAs 
in a translationally repressed 
state until stress is alleviated (7). 
Germ granules also store mater- 
nal mRNAs in a repressed state 
for prolonged periods of time wore 
(8). Germ granules store mRNAs 
that drive germ cell develop- 
ment. Therefore, an outstanding 
question is how translation of 
these mRNAs is activated. A re- 
cent study in C. elegans showed 
that translation takes place after 
the exit of mRNAs from germ 
granules (9). However, other 
studies hinted at a link between 
germ granules and translation 
activation. In both mice and 
Drosophila melanogaster, the 
PIWI proteins (core components mRNA 
of germ granules) were shown 
to activate translation by recruiting transla- 
tion initiation factors to germ granules (JO, 
11). Moreover, using a technique that records 
translation in living cells, it was shown that 
mRNAs can be actively translated in stress 
granules (72). Thus, translation can occur in 
RNP granules, but our understanding of this 
process is limited. 

Spermatogenesis in mammals is divided 
into several steps, the last one being sper- 
miogenesis. During spermiogenesis, hap- 
loid spermatids are transformed into sper- 
matozoa through a series of morphologic 
changes. Transcription gradually stops 
during this process; therefore, spermio- 
genic mRNAs that encode factors involved 
in spermatid differentiation are produced 
earlier and stored in a translationally re- 
pressed state until their timely translation. 
The molecular mechanisms underlying this 
switch from translational repression to acti- 
vation have remained a mystery. 

Analyzing the proteome associated with 
spermiogenic mRNAs that undergo active 
translation, Kang et al. identified FXR1. 
Phase separation of the related protein, frag- 
ile X mental retardation protein (FMRP), 
has been observed using in vitro systems 
and linked to translational repression (3, 
14). Kang et al. found that FXR1 binds ac- 
tively translated spermiogenic mRNAs, and 
Fxrl germ cell-specific deletion revealed 
that FXR1-mediated translational activation 
is important for late spermatid differentia- 
tion and mouse fertility. Prompted by the 


— 


SCIENCE science.org 


Translation repression 


Spermiogenic 


A molecular function of phase separation 
Cytoplasmic ribonucleoprotein (RNP) granules are classically associated 

with translational repression. But, in mouse spermatids, phase separation of 
FXR1 (fragile X mental retardation syndrome-related protein 1) is required 

for translational activation of spermiogenic messenger RNAs (mRNAs). FXR1 
associates with the translation factors EIF4G3 (eukaryotic translation initiation 
factor 4y3) and PABPC1 [poly(A) binding protein C1] and recruits them to 
granules in late spermatids. 
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presence of intrinsically disordered regions 
in FXR1, as in FMRP, the authors found that 
FXRI1 levels increase in late spermatids when 
spermiogenic mRNAs are translated, corre- 
lating with the formation of foci, which the 
authors established were phase-separated 
condensates (see the figure). 

Kang et al. managed to identify a point 
mutation in one of the KH RNA bind- 
ing domains of FXR1 that prevents FXR1 
phase separation while preserving its RNA 
binding capacities. Using this mutant in 
cultured cells and in mouse testes, they 


“.ulinderstanding how granule 
formation contributes 

to biological functions in vivo 

remains a major challenge.” 


showed that translational activation de- 
pended on the capacity of FXR1 to form 
RNP granules. Moreover, they restored the 
ability of mutated FXR1 to form granules 
by inserting an additional intrinsically 
disordered region from a heterologous 
protein, thus increasing its potential for 
multiple interactions. This additional re- 
gion rescued both the capacity of FXR1 to 
form granules and its function in activat- 
ing translation of spermiogenic mRNAs, 
leading to spermatid differentiation into 


is Ca 


Translation 


functional spermatozoa. Furthermore, in- 


troducing the FXR1 point mu- 
tation in mouse germ cells led 
to male sterility, phenocopy- 
ing Ferri germ cell deletion. 
These results clearly establish 
the critical role of FXR1 phase 
separation for translational ac- 
tivation and spermatogenesis 
in mice. 

Although RNP granule dy- 
namics during developmental 
processes have been described 
ss (15), understanding how gran- 

be ule formation contributes to 
biological functions in vivo re- 
mains a major challenge. This 
study represents an important 
advance in addressing this is- 
sue. However, many aspects re- 
main unresolved. It is unknown 
how FXRI1 phase _ separation 
activates translation and where 
in the granule translation takes 
place. The length of mRNAs, 
rather than the presence of mo- 
@ tifs, seems to determine mRNA 
AAAA inclusion in FXR1 granules as 
well as other RNP granules. This 
raises the question of how speci- 
ficity is achieved. Additionally, phase separa- 
tion of FXR family members induces transla- 
tional repression or activation. It is unclear 
whether the same protein orchestrates these 
opposite functions and through which mo- 
lecular determinants. Many RNP granules 
are not homogeneous, but rather are made 
of immiscible phases. Understanding how 
these different phases participate in differ- 
ent functions of RNP granules represents 
another important challenge. 
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ENVIRONMENTAL ECONOMICS 


Insurance-sector tools 
to combat biodiversity loss 


Risk transfer can facilitate nature-positive investments 


By Carolyn Kousky 


e are in the middle of a global mass 

extinction event (7). In addition to 

heightened risks related to food se- 

curity, disease, natural disasters, air 

and water quality, and nonmaterial 

aspects of life (7), loss of biodiver- 
sity also threatens the livelihoods of billions 
of people and creates risks to the economy. 
The World Economic Forum estimated that 
over half of global gross domestic product 
($44 trillion) is highly or moderately depen- 
dent on nature (2). Some of these risks are 
then transmitted to financial and insurance 
markets (3). Addressing all these risks has 
been articulated in the “nature-positive” goal 
of halting and reversing nature loss globally 
by 2030. Achieving this goal will require pro- 
found changes in the relationship between 
nature, economies, and social and governing 
institutions. As leading risk managers, the in- 
surance sector, which is not typically engaged 
widely on nature issues, has multiple tools to 
help mainstream the transition. 

Engagement of the insurance sector 
would generate widespread global benefits, 
but also lower the risks insurers face from 
biodiversity loss, including risks of higher 
claims payments, lower investment returns, 
higher operating costs, reputation impacts, 
and/or increased regulatory and transition 
risks (4). Below, I describe five primary path- 
ways that insurers can pursue to support na- 
ture-positive goals: creating new risk-transfer 
approaches, incorporating impacts on nature 
into underwriting, pricing for the risk reduc- 
tion benefits provided by natural systems, ap- 
plying a nature-positive lens to investments, 
and insuring nature itself (5). Despite myriad 
reports and papers suggesting possible ap- 
proaches for insurers to have an impact in 
this area, the number of pilots or programs 
that have actually been implemented is much 
smaller and is the focus of this discussion. 
Not discussed here, given the many other 
treatments of corporate environmental stew- 
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ardship—but equally important—is insurers 
improving the environmental practices of 
their own operations. 


NEW RISK-TRANSFER APPROACHES 
Conservation and restoration projects need 
a variety of insurance coverages, including 
some familiar ones, such as liability and 
worker’s compensation, as well as new prod- 
ucts. Many conservation groups, however, 
have found that insurers tend to be unfamil- 
iar with their needs and specific risk profile. 
This has made it difficult for them to find af- 
fordable insurance or, sometimes, any cover- 
age at all. In response, new organizations and 
products are emerging that unite expertise in 
conservation with expertise in risk transfer. 

One example is Conservation United, a 
brokerage firm dedicated to finding tailored 
insurance solutions for conservation groups. 
Founded by professionals from both con- 
servation and the insurance industry, the 
brokerage offers standard policies, includ- 
ing workers compensation, liability, auto, 
accident, umbrella policies, and also pollu- 
tion insurance, but tailored to the actual risk 
profile of conservation groups. By educating 
insurers about the specific risks, as well as 
the risk reduction measures that conserva- 
tion groups can adopt, Conservation United 
has often found more affordable policies for 
these groups. 

Conservation efforts may also require 
new products to meet specific needs, such 
as insurance and performance bonds to 
assure the achievement of ecological per- 
formance standards for wetland mitigation 
banks. In the US, when impacts to wet- 
lands trigger Clean Water Act regulations, 
they must be mitigated by restoring other 
wetlands and preserving them in perpe- 
tuity. To receive the necessary permits, 
insurance, performance bonds, or other 
financial assurance must be provided to 
guarantee sufficient funds for comple- 
tion of the wetland mitigation activity. 
Conservation United cofounder, Chris 
Baker, has developed such a product where 
failure to achieve a defined ecological per- 
formance standard for a wetland triggers 


a claim, with the payout used to bring the 
wetland into compliance. 

Sometimes, the risk transfer solution for 
conservation may lie outside the traditional 
insurance market. Consider land trusts in the 
US, non-governmental conservation groups 
that protect land using legal tools such as 
conservation easements. They may have 
to defend conservation lands against legal 
challenges, which can be expensive and for 
which they need insurance coverage. Several 
years ago, land trusts were struggling to find 
this coverage in the private market. Insurers 
were reluctant to take on this risk because 
the potential market was small and there 
was not much historical data on the size of 
the risk (and if insurers did take it on, they 
would charge a high premium). In response, 
the Land Trust Alliance created Terrafirma 
Risk Retention Group LLC. It has an un- 
usual organizational structure, being both a 
nonprofit organization and a captive, which 
is an insurance provider that is owned by 
the members to whom it provides coverage. 
Captives are a useful solution for risks that 
are difficult or too pricey to find coverage for 
in the market and have been used by many 
sectors. Being owned by members, they may 
also be better able to support risk reduction 
and risk education. This structure allows 
Terrafirma to provide the insurance more ef- 
ficiently than the private sector, because they 
do not take a profit and can reinvest premi- 
ums not needed to pay claims. As of the time 
of writing in 2022, Terrafirma offered conser- 
vation defense liability insurance policies to 
545 land trusts covering 10 million acres of 
conservation land. 

Additional new risk-transfer products are 
also emerging to support nature-positive 
activities. One example is human-wildlife 
conflict insurance. A review in 2017 found 
that since 1980, programs in 50 different 
countries have been established to make pay- 
ments to farmers or ranchers if wildlife from 
conservation areas destroy crops or livestock 
(6). The aim is to reduce retaliatory killings 
of wildlife. There has been increasing inter- 
est in harnessing insurance as a solution to 
human-wildlife conflict as it might speed 
payouts and improve operations. An ongoing 
challenge, however, is designing a product 
that beneficiaries are willing to pay for and 
can afford. To solve this, some approaches 
have channeled funding from government 
or wildlife-related tourism toward premiums 
(7). Indeed, public-sector support for insur- 
ance is often necessary for products aimed at 
lower-income populations. 


INCORPORATE NATURE INTO 
UNDERWRITING 

Underwriting is the process of evaluating 
risks and deciding whether to insure them. 
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Typically, underwriting decisions are driven 
by risk assessments. Explicitly accounting for 
risks stemming from degradation of nature 
might highlight cases where firms engaged 
in harmful environmental practices also 
have higher risks, such as liability risk, sup- 
ply chain disruption risk, or reputational risk 
(8), making the firms poorer risks to assume. 
Some limited steps have been taken in this 
direction, such as several insurers pulling 
back insurance for vessels engaged in illegal 
fishing (9). Greater research on when, and 
the extent to which, nature-positive activities 
lower risks could guide the industry further. 

Insurers could also restrict underwrit- 
ing for certain entities to support broader 
nature-positive goals, even if insuring those 
entities would not be a higher risk for the 
insurer. Recently, advocates have pushed 
this position, arguing that in- 
surers contribute to solving the 
climate crisis by refusing to in- 
sure carbon-intensive firms or 
facilities. In response, some in- 
surers are pledging to align their 
underwriting with a net-zero 
emissions goal by 2050 through 
the United Nations-convened 
Net-Zero Insurance Alliance, 
which claims that such pledges 
currently represent about 11% of 
global premium written. A simi- 
lar approach could be extended 
to risks associated with actions 
that harm biodiversity and spe- 
cies’ populations. 

Beyond refusing to insure 
firms that contribute to degrada- 
tion of nature, insurers could de- 
mand substantial modifications 
to their operations to lessen 
their environmental impact. 
For example, AXA has pledged 
to stop underwriting firms that 
contribute to biodiversity loss in 
key sectors, such as production of palm oil, 
soy, cattle, and timber, but also to work with 
others in these supply chains to adopt more 
sustainable approaches (J0). 


PRICING 

Some natural systems reduce disaster risks 
to property. Wetlands can store floodwaters, 
ecological forestry practices can reduce wild- 
fire risk, mangroves can buffer storm surge, 
green infrastructure can absorb storm water, 
and vegetation can stabilize slopes. Because 
the price of insurance reflects the underlying 
risk, when insured property is protected by 
these ecosystems, it should lower the cost of 
disaster insurance. Although there is hope 
that such reductions in insurance prices 
could incentivize greater investments in con- 
servation and restoration, it is not so easy. 
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First, a lower insurance price only incentiv- 
izes investments in risk reduction for those 
who are actually insured—and there are 
many entities not insured against disasters. 
Second, to lower prices due to protection 
from natural systems, insurance firms need 
to be able to quantify how much the con- 
servation will affect the likelihood or mag- 
nitude of claims they would have to pay on 
insured property. Modeling the benefits of 
nature at this scale is relatively new and typi- 
cally highly uncertain. Still, progress is be- 
ing made. A recent analysis by Willis Towers 
Watson (WTW) and The Nature Conservancy 
(TNC), for example, used catastrophe models 
(the industry standard for simulation-based 
modeling of the probability and impact of 
disasters) to estimate premium reductions of 
10 to 40% when protected by landscape-scale 


A first-of-its-kind insurance policy will support repair of damages inflicted 
by Hurricane Delta in October 2020 on portions of the Mesoamerican Reef, 
shown here near Isla Mujeres, Mexico, in September 2018. 


ecological forestry approaches (11). AXA XL 
and the Ocean Risk and Resilience Action 
Alliance are developing a coastal risk index 
to quantify the impact of mangroves on flood 
risk; one use case they envision is helping in- 
surers price the value of natural systems. 

The final challenge is that natural systems 
tend to provide protection to many proper- 
ties, and the premium reductions on just one 
are not enough to offset the costs of expanded 
conservation. Indeed, the WIW-TNC report 
on ecological forestry found that premium 
reductions would be sufficient to finance the 
debt service on a bond to undertake the res- 
toration work—but only when accounting for 
the many property owners that would ben- 
efit. Even for large insured parties, like a tim- 
ber company or electric utility, the economics 
did not work on their own. 


That is a problem that plagues all invest- 
ments in conservation and _ restoration. 
Natural systems are public goods, and every- 
one has an incentive to free ride and let oth- 
ers pay to provide the public good. So, how 
can we develop an institutional structure 
that can overcome free-riding and harness 
the reduced insurance premiums from doz- 
ens to thousands of individual policyholders 
and use that as a financial flow to pay for eco- 
logical investments? 

There are some innovative options to har- 
ness premium reductions as a financing flow, 
none of which are yet in operation. One is to 
consider insurance at the level of the commu- 
nity instead of the individual. Such commu- 
nity-based insurance could provide financial 
protection for all residents against disasters 
(helping close the global disaster insurance 
gap), and the insurance savings 
from community-level invest- 
ments in nature-based protec- 
tion could financially justify 
greater investments in nature 
(12). A study examining a levee 
setback project (which provides 
ecological benefits by expand- 
ing the natural floodplain) found 
that such an investment could be 
financed by premium reductions 
if the community was purchas- 
ing a community-wide insurance 
policy (3). Community insur- 
ance, though, would need to be 
paid for through taxes or general 
revenue. No community has yet 
been willing to tax households or 
use other funds to purchase this 
type of community insurance. 

Without a single policy, it is 
trickier to use insurance savings 
to pay for conservation or resto- 
ration. One option is for a local 
government or special-purpose 
district to enter an agreement 
with insurers about the premium reductions 
they would offer to all residents if a conserva- 
tion or restoration project was undertaken. 
The government could then finance that 
investment with a bond and assess fees on 
residents to pay the debt service, knowing 
that the residents would be financially whole 
given their lower insurance costs. This faces 
political and institutional challenges more 
than financial ones. 


INVESTING FOR NATURE 

Insurers are large holders of capital, with 
trillions of dollars under management glob- 
ally. Much of this must be held in lower-risk 
assets to ensure future claims-paying abil- 
ity. But insurers still have some flexibility 
in how they allocate their investments. 
And like other firms, many insurance and 
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reinsurance companies are applying an en- 
vironmental, social, and governance (ESG) 
lens to their investment portfolio. This can 
involve refusing to invest in certain compa- 
nies, such as those with substantial carbon 
emissions; proactively investing in firms 
that are helping solve environmental prob- 
lems; and actively engaging with companies 
on their environmental practices. Although 
this type of investing has advanced around 
climate, nature-positive investing is still in 
its infancy. Identification of risks related to 
biodiversity loss and environmental deg- 
radation, as well as measurement and dis- 
closure, is still limited, but growing. The 
new Taskforce on Nature-related Financial 
Disclosures is developing a framework to 
mirror climate disclosures. Smaller groups 
guided toward certain issues, such as the 
Plastic Solutions Investor Alliance, are 
also forming and create opportunities for 
insurer engagement. Only an estimated 
14% of global investments in conservation 
and restoration come from the private sec- 
tor; this must at least triple by 2030 to meet 
global climate and biodiversity targets (14). 
Insurers, with their large holdings of capital, 
could help lead and develop best practices in 
this area. For example, AXA has created an 
impact fund that supports efforts like sus- 
tainable agroforestry and cocoa production. 

Some insurers are finding that companies 
with negative environmental impacts create 
other risks that make them poor financial 
investments, such as liability risks, risk of 
losing market share, or risks of losing their 
license to operate. This can strengthen the ra- 
tionale for focusing on investments that are 
nature-positive. Insurers may also find ad- 
ditional motivation for impact investments 
that support conservation or restoration of 
natural systems that lower disaster risks, 
such as wetlands or green infrastructure that 
lowers flood risk. When insurers are pro- 
viding property coverage for disasters, they 
could then directly benefit from such invest- 
ments in natural systems. Recognizing this, 
a group of Canadian insurers have recently 
joined Ducks Unlimited and provided fund- 
ing to a new partnership, Nature Force, that 
will identify investments in natural systems 
to manage flood risk in urban-adjacent areas 
of British Columbia, Ontario, and Quebec. 


INSURING NATURE 

Natural systems can be damaged from 
extreme events and need to be repaired. 
Consider coral reefs. Very severe hurricanes 
can break off coral and make it hard for coral 
to regrow. If teams of divers enter the water 
quickly after a storm, however, they can re- 
attach coral and help repair the reef. This has 
been done, with the help of insurance, to pro- 
tect a portion of the Mesoamerican Reef. 
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Multiple groups in Quintana Roo, Mexico, 
that care about the health of the reef formed 
the Coastal Zone Management Trust. In 
2019 (renewed in 2020 and 2021), the Trust 
achieved a first: It purchased an insurance 
policy on the reef itself to secure funding im- 
mediately after a storm to pay for divers to 
reattach coral. This was a parametric policy, 
meaning that payment was not based on 
damages but on observable measures of the 
hazard itself. Parametric insurance can be 
useful, as in this case, when it would be dif- 
ficult for the insurance company to estimate 
a dollar value of the actual damage. In the fall 
of 2020, when Hurricane Delta hit, the policy 
paid around $800,000 to the Trust for resto- 
ration work on the reef (see photo). 

Although the pilot is now expand- 
ing the concept to other sections of the 
Mesoamerican Reef, the approach has not 
yet been widely replicated in other systems, 
partly because there are limited cases where 
this is a useful model and where the institu- 
tional challenges in implementation, such 
as convincing many beneficiaries to jointly 
pay for insurance, have been solved (15). For 
many natural systems, after a natural disas- 
ter, it is best to leave the ecosystem alone to 
heal itself. So, the first criteria for replicabil- 
ity of this concept is identifying where res- 
toration intervention is needed in response 
to some type of disaster. Second, insurance 
is not always cost-effective. There are many 
times where it makes more financial sense 
to self-insure—that is, set money aside in a 
savings account to use immediately when 
needed, rather than pay annual premiums 
for an insurance policy. Third, there must be 
an institution to finance the insurance that 
spans the users and can overcome free-rid- 
ing. Finally, many serious threats facing eco- 
systems now are uninsurable because threats 
are chronic and/or there is not a clear resto- 
ration or repair activity to be financed. Coral 
reefs are threatened more by warming and 
acidifying ocean waters from climate change 
than by hurricanes, for example, but insur- 
ance cannot address those threats. 

Although those criteria narrow the pos- 
sible applications, this is still an innovative 
approach that could prove useful in the 
right circumstances—for example, insuring 
culturally important natural assets or the 
ecosystems in popular tourist locations to 
help quickly reestablish cultural and recre- 
ational values. Another possible application 
is to insure green infrastructure for its perfor- 
mance, or against possible setbacks in initial 
construction. 


MOVING BEYOND IDEAS 

Global coordination on the insurance in- 
dustrys role in broader environmental 
topics is emerging, such as through the 


United Nations Environment Programme 
Finance Initiative’s Principles for Sustainable 
Insurance. There are many theoretical dis- 
cussions on possible roles for insurers to con- 
tribute to building a nature-positive future, 
but far fewer implemented pilots or projects. 
Moving beyond ideas will require better edu- 
cation about risk transfer among those in the 
conservation field, as well as more time spent 
understanding the needs of conservation 
and restoration practitioners on the part of 
insurers. It will require insurance regulators 
to support nature-positive innovations and 
practices and the public sector to partner 
and potentially cost share on some efforts. 
It will also require insurance-sector firms to 
make the decision to proactively lead and 
make mission-driven decisions with respect 
to underwriting and investments. Finally, 
it will require deeper research on the risks 
associated with environmental activities, 
an exploration of current barriers to imple- 
mentation, and a sharing of best practices to 
move from theory to practice. 
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Our rhythmic existence 


A neuroscientist interrogates sleep and circadian clocks 


By Manon Torres and Achim Kramer 


id you wake up with an alarm clock 

this morning or work during the 

night last night? If the answer to 

either question is “yes,” then you 

have been doing yourself a disser- 

vice by ignoring your internal clock. 
Current science indicates that our bodies’ 
natural rhythms are essential for health, 
well-being, and productivity. So why do so 
many of us ignore or attempt to overcome 
them? In his new book, Life Time, neuro- 
scientist Russell Foster takes the reader on 
a vivid and knowledgeable quest to under- 
stand sleep and biological rhythms that is 
part scientific treatise, part self-help guide, 
and part plea for societal and personal life- 
style changes that better prioritize these 
biological functions. 

Humans evolved in a rhythmic environ- 
ment resulting from the rotation of Earth 
on its axis. We have therefore, like most 
species, developed internal or circadian 
clocks (from the Latin circa diem, meaning 
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“about a day”) to anticipate the daily recur- 
ring changes and challenges in our environ- 
ment. These clocks are molecular oscillators 
present in every cell of our bodies and or- 
chestrated by a master clock in the supra- 
chiasmatic nucleus of the hypothalamus, a 
structure located in the brain. 

The science of circadian clocks has ex- 
ploded in recent decades, impressively 
demonstrating that they are ge- 
netically encoded, varying from 
person to person, and that the in- 
terplay of the clock with an organ- 
ism’s behavior, environment, and 
age profoundly influences every 
aspect of the organism’s physiol- 
ogy. Foster’s own research was 
instrumental in demonstrating 
that light synchronizes our inter- 
nal clocks with our environment, 
by means of special blue light- 
sensitive receptors in the retina. 

Circadian clocks not only govern our 
sleep but also exert an extremely sophis- 
ticated temporal control over nearly every 
physiological and cognitive function. Fos- 
ter explains these complex relationships 
in simple but always scientifically correct 
language, highlighting the ubiquity of circa- 
dian regulation and the enormous impact it 
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Night work fails to account for our bodies’ natural 
rhythms and can lead to sleep and health problems. 


has on metabolism, as well as on immunity 
and cognitive, cardiovascular, and kidney 
function (among other things). He spares 
the reader the molecular details, paraphras- 
ing instead by saying that a clock anticipates 
events to ensure “the correct materials in 
the right place, in the right amount, at the 
right time of day.” 

But what happens when this system gets 
out of balance, for example, when one works 
nights, frequently travels between time 
zones, or gets up earlier than one’s biological 
clock dictates? The most direct consequence 
is often sleep problems, but given the strong 
linkage between biological rhythms and 
physiological function, such disruptions can 
also have longer-term health consequences. 
Here, Foster describes how circadian dis- 
ruptions can contribute to or exacerbate 
common diseases such as schizophrenia, dia- 
betes, obesity, multiple sclerosis, migraine, 
infectious diseases, and cancer. 

Many conditions themselves perturb 
sleep quality and circadian rhythms, and a 
number of disorders are characterized by 
time-of-day-dependent symptoms. Foster 
notes that better-timed medications could 
materially improve patient outcomes in the 
latter category. Together, however, such con- 
ditions illustrate how sleep and circadian 
disruption is both a potential risk factor for, 
and a possible symptom of, disease. 

It is a distinct merit of Life Time that the bi- 
ology of circadian clocks and that of sleep are 
treated as they are: intrinsically intertwined 
fields of science. Nevertheless, we need more 
research to disentangle the two phenomena, 
where possible, to determine which health 
problems are caused primarily by poor or in- 
sufficient sleep and which are caused by dis- 
ruption of the temporal physiological 
structure. Foster elides these knowl- 
edge gaps by simply subsuming all 
forms of sleep and rhythm problems 
under the term “sleep and circadian 
rhythm disruption.” 

Using relatable examples and 
tools to determine our own bio- 
logical rhythms, Foster highlights 
aspects of our daily routines that 
we can target to improve our sleep 
quality and overall well-being. 
But he does not leave it at recom- 
mendations for individuals. He advocates 
forcefully for societal change, especially 
for better education on sleep and circadian 
rhythms, better physician training, the abo- 
lition of daylight saving time, later school 
start times for teenagers, and more. 
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Longevity lessons 


Animal aging could hold clues to healthier human life spans 


By Charles Brenner 


eaders of several recently published 

books about longevity have been 

told that human aging is a solved 

problem. One recent book argues, 

for example, that dominantly acting 

longevity genes, identified in yeast, 
are conserved in animals including humans 
and can be activated by natural products 
such that we do not have to age (J). Given 
that the most powerful animal “longevity 
genes” are actually recessive alleles of genes 
required for growth and fertility, the 
certainty with which such claims 
have been made is astonishing. 

It was therefore refreshing to read 
Methuselah’s Zoo, the latest book 
from biologist Steven Austad, who 
expresses with clarity what we know 
and what we don’t know about aging. 
Austad does not believe that aging is 
a solved problem, citing uncertainty 
about phenomena as fundamental as 
the timing of menopause with respect 
to women’s life span. However, he be- 
lieves that by dissecting the mecha- 
nisms by which particular animals 
age well, we will be able to develop 
medicines that promote longevity in 
the future. 

Indeed, Austad is so confident of 
this assertion that in 2001, he bet 
gerontologist Jay Olshansky that the 
first person who will live to 150 had 
already been born. Their heirs will 
settle this wager no later than 2150. 

Definitions matter in aging, so I ap- 
preciated that Austad establishes some 
boundaries early on. While he tips his 
hat to trees that are thousands of years 
old, he reasons that the ability of plants 
to expand clonally puts them in a different 
category than nearly every animal. 

A former professional lion trainer and 
someone who spent decades studying ani- 
mals in wild habitats, Austad introduces the 
longevity quotient (LQ) as a general system 
of classification that indicates whether a 
species’ life span is above or below a trend 
line that adjusts for its size. (Big animals 
generally live longer than little animals, so a 
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species’ absolute life span does not capture 
whether it really is adept at aging.) 

Scoring longevity on a size-normalized 
scale thus allows Austad to characterize 
mice as poor agers and to identify tuataras, 
Greenland sharks, rougheye rockfish, and 
olms (a type of salamander) as extraordi- 
narily good agers. Although their estimated 
LQ is not as high as these four species, 
naked mole rats appear to age better than 
humans and, indeed, they are the only ani- 
mal model organism to have a higher LQ 
than humans. (Note that the longest-lived 


Wisdom, the oldest known wild bird, is shown just shy of 68 years old. 


whales, which outlive humans by a century, 
have a lower LQ than humans because their 
size goes in the denominator.) 

Having established these terms, Austad 
argues that it is unfortunate that we depend 
on the laboratory mouse as much as we do 
to learn how to extend a healthy life span. 
He quips that humans have much to teach 
mice about longevity, rather than vice versa, 
and suggests broadening geroscience to 
animals with higher LQs in order to iden- 
tify ways that organisms are able to remain 
cancer-free, physically active, and mentally 
astute into later decades. Cellular or organ- 
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oid models of tissue resiliency and repair 
could potentially make up for our inability 
to cultivate the longest-lived animals as lab- 
oratory aging models, he argues. 

I agree that we will find mechanisms of 
resiliency in animals of high LQ, but I would 
make a distinction between extended life span 
and extended health span. In large part, the 
selective pressures that formed animal gene 
sets are bottlenecks that optimized successful 
reproduction, which integrates many complex 
functions, including the ability to acquire 
food, avoid predation, attract a mate, and— 
in some cases—care for offspring until 
they can accomplish these functions 
themselves. These abilities require 
thousands of gene functions in the ner- 
vous, musculoskeletal, respiratory, and 
circulatory systems, and more. 

Because animals that succeed in 
reproducing multiple times pass on 
the most genes, there is an indirect 
selection for longevity. However, 
clever as evolution is, it doesn’t pro- 
duce something for which there isn’t 
a selection, namely postreproductive 
vigor, unless, for example, grand- 
mothering sufficiently improves the 
reproductive success of an individu- 
al’s offspring, as has been suggested 
in human longevity research (2). 

If I were refereeing a debate be- 
tween Austad and Olshansky, I’d imag- 
ine Olshansky would argue that any 
potential longevity medicine is more 
likely to help more people get to 100- 
plus years—that is, closer to a geneti- 
cally encoded longevity maximum— 
than to extend the observed maximum 
(3). In this case, the mouse being a 
poor ager might make it a good model 
for health-span research, with end 
points such as improved wound healing and 
infection resistance in 20-month-old mice 
rather than extending the average life span 
of a mouse from 2 years to 2.5. If, however, 
we someday derive longevity medicine from 
the cave-dwelling olm, Steven Austad will 
deserve an assist. 


REFERENCES AND NOTES 
1. D.A. Sinclair, M.D. LaPlante, Lifespan: Why We Age— 
and Why We Don’t Have To (Atria Books, 2019). 
2. K.Hawkes, J. E.Coxworth, Evol. Anthropol. 22, 
294 (2013). 
3. S.J.Olshansky, JAMA 320, 1323 (2018). 


10.1126/science.add9130 


science.org SCIENCE 


PHOTO: MADALYN RILEY/USFWS VOLUNTEER/FLICKR 


science.org/journal/stm 


PUT HUMAN 
HEALTH AT 

THE HEART 
OF YOUR ~ 
RESEARCH ., 


Submit your research: 
cts.ScienceMag.or 


1 


4 


: 
\ 


Science 
Translational 
Medicine (7 Twitter: @ScienceTM 


MVAAAS Ea Facebook: @ScienceTranslationalMedicine 


INSIGHTS 


LETTERS 


Edited by Jennifer Sills 


Protect Brazil’s 
overlooked Pampa biome 


Brazil’s Pampa biome is characterized 
by grassland vegetation and covers half 
of Rio Grande do Sul, the southernmost 
state in Brazil (1), bordering Argentina 
and Uruguay. The Pampa is the biome at 
the most risk in Brazil according to the 
Conservation Risk Index (7), and recent 
data point to an alarming 92% increase 
in the loss of Brazilian Pampa vegetation 
in 2021 compared to data from 2020 (2). 
Brazil must increase conservation efforts 
in this overlooked biome. 

Although the Pampa does not support 
striking arboreal vegetation as seen in 
Brazil’s forests, the region contains sub- 
stantial biodiversity and performs critical 
ecological functions. Just 1 m?in the Pampa 
can harbor more than 50 plant species (3). 
This biome is home to 476 bird species (1) 
and serves as a breeding and resting place 
for migratory birds (4). The Pampa contrib- 
utes to CO, capturing, climate and water 
cycle regulation, pollination, and food pro- 
duction, among other benefits to humans 
and animal populations (J, 4, 5). 

Maintaining the Pampa’s biodiversity 
also contributes to human health. High 
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biodiversity reduces, or “dilutes,” infec- 
tious zoonotic disease risk (6, 7). Access to 
green spaces promotes mental health (8). 
Exposure to the ecosystem’s antigens and 
microorganisms is important for immune 
development and healthy microbiota, 
respectively, and the vegetation helps 
remove air pollutants and contain noise 
pollution (8). 

The Pampa’s ecosystems are neglected 
in Brazil’s environmental protection poli- 
cies. Only about 1% of the biome is under 
protection, the smallest protected area of 
any biome in Brazil (5). Approximately half 
of natural Pampa vegetation has been lost, 
mostly to be replaced by pasture and cash 
crops (5, 9). Pesticides used on the crops 
have contaminated nearby rivers and soil, 
threatening humans and native species (0, 
11). Invasive species, mining projects, and 
urban expansion (9) put the Pampa’s biodi- 
versity under increasing risk. 

Recently, 19 Brazilian entities pub- 
lished 10 guidelines to guarantee the 
Pampa’s protection, including planning 
the occupation and use of the biome’s 
land, establishing sustainable agricultural 
and livestock production chains in the 
Pampa, and regulation and protection of 
traditional territories (12). Brazil should 
work to implement these suggestions. In 
addition, more research and conserva- 
tion efforts should focus on the Pampa’s 


Brazil’s vulnerable Pampa biome hosts biodiversity 
and provides ecosystem services. 


animal biodiversity, including increased 
efforts to prevent poaching. Also, invasive 
species in the region must be better con- 
trolled. Recognizing and protecting South 
American grasslands diversity is funda- 
mental to safeguarding environmental 
and human health. 
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Brazil’s Pantanal 
threatened by livestock 


Brazil’s Pantanal region is a UN 
Educational, Scientific, and Cultural 
Organization World Heritage Site 
renowned for its spectacular wildlife (7). 
The vast wetland faces threats ranging 
from record-breaking fire outbreaks (2) 
to waterway plans that would dredge its 
main river and reduce water levels in the 
ecosystem (1). Cattle ranching is an ever- 
increasing threat to the Pantanal (3, 4), 
and a new bill has just been approved by 
the state of Mato Grasso that will open 
the protected portions of ranches within 
the Pantanal to use for livestock. If the 
policies in this bill are implemented, 
the changes will be devastating to the 
Pantanal’s biodiversity and ecosystems. 
On 29 June, the Mato Grosso Legislative 
Assembly approved Bill (PL) No. 561/2022 
(5), which would allow cattle in the 
“permanent preservation areas” (APPs) 
and “legal reserves” (RLs) required by 
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Brazil’s forest code (Law 12.727/2012), as 
well as in all other areas in the Pantanal 
designated for permanent conservation. 
Previously, cattle were only allowed in 
seasonally flooded grasslands in this part 
of Mato Grosso (6). Opening protected 
land for cattle is incompatible with the 
goals of protected status. Cattle ranching 
is extremely harmful to protected lands, 
with the capacity to change species com- 
position and richness (7). 

Fallacious arguments have been used to 
expand livestock in the Pantanal, such as 
President Jair Bolsonaro’s statements that 
cattle help to prevent fire. Such claims 
have no scientific basis, in contrast to 
the well-documented damage caused by 
cattle. The presence of cattle in protected 
areas kills trees, fragments habitats, and 
causes erosion that results in siltation of 
water bodies (7). Degradation of the veg- 
etation in these areas makes them more 
susceptible to fires (8). These fires are 
expected to increase further with global 
warming and could take the Pantanal 
beyond its threshold for tolerating degra- 
dation, resulting in collapse of this impor- 
tant region’s ecosystem services (2). 

The Pantanal is also home to 
Indigenous peoples who would be affected 
by the newly approved law (6). These peo- 
ple have a right to free and informed con- 
sultation in advance about activities that 
may affect their territory and way of life, 
as established by International Labour 
Organization Convention 169. Although 
Brazil signed and ratified this convention, 
violations are frequent (9). 

This law sacrifices Brazil’s people and 
ecosystems in the name of profits. The 
country has idle areas with the capacity 
to expand its livestock production with- 
out using any additional protected areas 
with native vegetation (10). Moreover, the 
expansion of ranching allowed by this bill 
would increase exports rather than meat 
available for domestic consumption (77). 

A complaint has been filed with the 
Federal Public Ministry recommend- 
ing the immediate suspension of PL No. 
561/2022 (12). Brazilian courts must base 
their decision on technical and scientific 
information and immediately suspend 
this new law. In addition, countries that 
import commodities from Brazil need to 
assess the impacts of this bill and review 
their imports of Brazilian beef. 
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Tech companies liable for 
illegal wildlife trade 


In a move heralded by conservationists 
and tech policy reformists alike, Brazil’s 
environmental regulatory agency has 
penalized Meta, the parent company of 
Facebook and WhatsApp, for failing to 
remove thousands of online posts that had 
been flagged for selling illegal wildlife (7). 
The ~US$2 million fine will not materi- 
ally affect the world’s largest social media 
company, which in 2021 earned more than 
US$117 billion in global revenue (2), but 

it was nonetheless an important salvo 

in the fight to hold Meta responsible for 
vast amounts of illegal wildlife trading 
occurring on its platforms (3). To the best 
of our knowledge, the July ruling marked 
the first time any tech company has been 
sanctioned for hosting wildlife crime. The 
decision should set a precedent for hold- 
ing tech companies accountable. 

The fine was imposed after Brazilian 
conservationists provided authorities with 
evidence that illegal wildlife traders oper- 
ated groups on Facebook and WhatsApp 
and that Meta had failed to remove the 


content after being made aware of it, 
claiming it did not violate its community 
standards (1). Meta’s response was typical. 
Although most tech companies officially 
forbid the trading of endangered spe- 
cies on their platforms (4), most parts of 
the world provide tech platforms with 
immunity for user-generated content, 
even when the activity is locally illegal. 
As a result, many tech companies do not 
enforce their own rules. 

Legal reform is on the horizon. The 
Digital Services Act agreed upon by the 
European Union in April (5) and the 
UK’s Online Safety Bill currently under 
debate (6) will impose a duty of care on 
tech platforms to restrict and remove 
organized crime activity on their sys- 
tems. We support these reforms and urge 
other countries to adopt similar laws. 
Cybercrime legislation should explicitly 
address illegal wildlife trading, which is 
now largely dependent upon social media 
(3, 7). Failure to comply should result in 
substantial fines, as modeled by Brazil. 
Only successive financial sanctions and 
public pressure will incline noncompliant 
companies to invest in effectively combat- 
ing wildlife crime. 
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Divergent species create distinct hybrids 


ybridization between species is common in many taxa and can lead to a variety of out- 

comes. Hybrids can backcross with parent lineages, increasing genetic diversity, or they 

can develop into new species. In squamate lizards, hybridization can create polyploid 

species with unisexual reproduction (parthenogenesis). Barley et al. used North American 

whiptail lizards, which include 15 recognized unisexual species, as a model system to 
examine the possible outcomes of hybridization. Phylogenetic analyses showed that divergence time 
between parents predicts the effects of their hybridization. Hybrid speciation and unisexuality 
were more common with more time since the parent species diverged from a common ancestor. 
—BEL Science, abn1593, this issue p. 773 


A study of many species of North American whiptail lizards, such as the individual pictured above, 
identifies outcomes of hybridization. 
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Blocking a range of 
coronaviruses 


Seven coronaviruses cause 
human disease, and three have 
caused serious outbreaks in the 
past 20 years. The potential for 
future coronavirus outbreaks 
in humans and the ongoing 
resistance of severe acute respi- 
ratory syndrome coronavirus 2 
(SARS-CoV-2) variants to existing 
antibodies make it important to 
identify cross-reactive antibodies 
that can be the basis of thera- 
peutics and guide vaccine design. 
Low et al. and Dacon et al. isolated 
antibodies from convalescent 
individuals that show broad neu- 
tralizing activity against a range of 
coronaviruses, including Omicron 
variants of SARS-CoV-2. The anti- 
bodies target a conserved region 
of the viral spike protein known as 
the fusion peptide and may act by 
preventing the cell fusion that is 
required for infection of new host 
cells. -VV 

Science, abq2679, abq3773, 

this issue p. 735, p. 728 


Concerted proton and 
energy transfer 


Proton-coupled electron transfer 
(PCET) reactions have gener- 
ated considerable interest 
because of their essential role 

in various energy conversion 
processes in biology. Pettersson 
Rimgard et al. report another 
type of mechanism denoted as 
proton-coupled energy transfer 
(PCEnT), in which proton transfer 
is coupled to electronic excita- 
tion energy transfer. PCEnT was 
experimentally detected when 
analyzing the excited state 
behavior for a series of anthra- 
cene-phenol-pyridine trimers 

in low-temperature conditions, 
where PCET is thermody- 
namically hindered. Theoretical 
calculations showed that the 
observed PCEnT is a non- 
adiabatic singlet-singlet energy 
transfer coupled to proton 
tunneling, which, unlike PCET, 
occurs with no charge transfer 
between donor and acceptor. 
PCEnT is potentially important 
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for light-activated chemistry, pho- 
tonic materials, and photobiology, 
but it has yet to be identified for 
natural systems. —YS 

Science, abq5173, this issue p. 742 


Generating electron- 
photon pairs 


The interaction of electron 
beams with cavities and resonant 
structures represents a universal 
scheme for generating electro- 
magnetic radiation. Feist et al. 
fabricated structures with phase- 
matched interactions between 
free electrons and the vacuum 
cavity field of a photonic chip- 
based microresonator. As the 
electrons passed near the resona- 
tor, coupling between them and 
the vacuum field resulted in the 
spontaneous generation of pho- 
tons within the cavity. Because 
the electron-photon pairs are 
correlated, they should be a use- 
ful source for the development 
of free-electron quantum optics 
providing enhanced imaging and 
sensing capabilities. —ISO 
Science, abo503/, this issue p.777 


When active and passive 
fluids interact 


Incompatible liquids such as oil 
and water will phase separate 
with low interfacial tension. 
Adkins et al. investigated the 
dynamics of a one-dimensional 
interface separating an active 
nematic phase with a passive iso- 
tropic phase (see the Perspective 
by Palacci). They found a rich 
behavior of fluctuating interfaces 
in which the phase-separating flu- 
ids could form active emulsions 
that did not coarsen and in which 
droplets formed spontaneously. 
Macroscopic interfaces can also 
display propagating waves with a 
characteristic wave number and 
speed. Furthermore, the activity 
of one of the fluids, in which the 
addition of energy drove the 
ordering of that fluid, was able to 
modify the wetting transitions. 
The authors also observed active 
wetting of a solid surface whereby 
active extensile stresses parallel 
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to the surface drove the fluid to 
climb a solid wall against gravity. 
—MSL 
Science, abo5423, this issue p. 768; 
see also adc9202, p. 710 


Sniffing out odorant 
receptors 


Odorant receptors (ORs) are 
G protein-coupled receptors 
(GPCRs) found in the cilia of olfac- 
tory sensory neurons (OSNs) in 
the nasal epithelium. Because 
only one OR is usually present in 
an OSN, there are too few ORs 
to enable identification of the 
specific odor that activates an OR 
in vivo. Omura et al. developed a 
platform to profile human ORs 
ex vivo. Cilia were isolated from 
OSNs of mice expressing a spe- 
cific human OR and were found 
to contain all of the components 
of the OR signaling pathway. 
The cilia were screened for their 
responses to potential ligands 
in both liquid and vapor phases. 
—JFF 

Sci. Signal. 15, eabm6112 (2022). 


Dual protection 
Most of the world’s primate 
species are threatened with 
extinction due to climate change 
and reduction of their natural hab- 
itats caused by forest clearance 
for agriculture, infrastructure 
development, and industrial-scale 
resource extraction. These fac- 
tors also pose significant threats 
to the lifeways and cultures of 
the Indigenous peoples who 
live in these areas. Estrada et al. 
conducted an extensive review 
of the literature of the global dis- 
tribution of primate species that 
included a spatial analysis of their 
current habitats and the degree 
of threat they face. The authors 
found that as lands managed 
by Indigenous peoples increase, 
threats of extinction to primate 
species decrease. They propose 
that the national governments 
of these areas promote policies 
that protect both primates and 
Indigenous lifeways. -—MSA 
Sci. Adv. 10.1126/ 
sciadv.abn2927 (2022). 


CANCER MICROBIOME 


Edited by Caroline Ash 
and Jesse Smith 


Diarrhea bug linked to cancer 


olorectal cancer (CRC) is a leading cause of 

death, with incidence rates that have sharply risen 

in recent years. Drewes et al. found that a bacterium 

called Clostridioides difficile, which is commonly associ- 

ated with infectious diarrhea in humans, may also be 
involved in the development of CRC. Bacterial isolates were 
obtained from human CRC specimens and tested to see if 
they could promote tumors when inoculated into mice. The 
toxigenic version of C. difficile that causes human diarrhea 
did trigger tumor formation. When the researchers inacti- 
vated the TcdB toxin made by the bacteria, they observed 
an impaired ability to form tumors, suggesting that TcdB 
exposure is key to the formation of CRC in the model. —PNK 
Cancer Discov. 10.1158/2159-8290.CD-21-1273 (2022). 


A diarrhea-causing bacterium, Clostridioides difficile, shown here in 
a colored transmission electron micrograph, is also associated with 
colorectal cancer. 


Admixture reveals 
epistatic effects 


Variants found in genetic stud- 
ies of disease replicate poorly 
across populations. Explanations 
range from local genetic context 
to epistatic interactions with 
other genes or the environ- 
ment. Patel et al. tested whether 
epistasis influences the effects of 
variation in regions of European 
ancestry that are shared among 
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Americans of different ethnicity. 
If local genetic context is the key 
factor determining a variant’s 
effect size, then it should act 
similarly in European Americans 
and African Americans with 
European ancestry at or around 
that variant. However, for both 
low-density lipoprotein choles- 
terol and gene expression in an 
atherosclerosis study, these vari- 
ants were found statistically to 
have dissimilar effects between 
the groups. These results indicate 
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that gene-by-gene or gene-by- 
environment interactions modify 
the effect size of variants associ- 
ated with complex human traits. 
—CNS 

Am. J. Hum. Genet. 109, 1286 (2022). 


OPTICS 
Limits on space 
compaction 


Optical devices such as 
telescopes, microscopes, and 
cameras contain three main 
components: lenses, detectors, 
and space. Although metalenses 
and plasmonic sensors have 
shrunk the physical size of 
lenses and detectors, the space 
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between them has remained 
largely the same. However, 
the development of “space- 
plates,” a slab of lower-index 
material placed between the 
lens and detector, can reduce 
this separation, effectively 
compressing space. Shastri 
et al. theoretically studied the 
bounds and limitations of space 
compression, finding that there 
are trade-offs to be made in 
the bandwidth, the numerica 
aperture, and the extent of com- 
pression. The findings should 
help guide development of 
space-compressing spaceplates 
and the further miniaturization 
of optical devices. —ISO 

Optica 9, 738 (2022). 
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SUPERCONDUCTIVITY 
Charge order ina 
cuprate look-alike 


Infinite-layer nickelates, which 
were recently found to be 
superconducting, have striking 
structural similarities with the 
still-enigmatic cuprate super- 
conductors. However, how far 
the similarities go in terms of the 
phases that they exhibit is still 
an open question. Rossi et al. 
used resonant x-ray scattering in 
the nickelate family La,_Sr,NiO, 
to detect an incommensurate 
charge order that is known to be 
ubiquitous in the cuprates. The 
ordering was strongest in the 
parent compound, and its doping 


dependence suggests that it is 
caused by electronic correlations. 
JS 
Nat. Phys. 10.1038/ 
s41567-022-01660-6 (2022). 


OCEAN MIXING 


All mixed up 
Mesoscale ocean eddies have 
significant impacts on general 
circulation and mixing. Although 
their behavior and effects are 
clearly observable, how to 
model them more accurately 
is a continuing challenge for 
physical oceanographers. Radko 
and Kamenkovich incorporated 
an old idea put forth by James 
Clerk Maxwell about the kinet- 
ics of molecular dispersion 
into numerical simulations of 
mesoscale ocean eddies and 
show that this results in clear 
improvements in eddy represen- 
tation. This approach thus offers 
a basis for the development of 
better mixing parameterizations 
in models of physical ocean 
processes. —HJS 
Geophys. Res. Lett. 
10.1029/2022GL099670 (2022). 


IMMUNOMETABOLISM 
B cells and obesity? 
You (T-)bet! 
Previous work has implicated 
B cells in obesity-associated 
chronic inflammation and 
metabolic syndrome. Hagglof et 
al. report that B cells expressing 
the transcription factor T-bet 
expand in the adipose tissue 
of obese mice and humans. 
Expansion of T-bet* B cells is 
supported by the production 
of inflammatory cytokines, 
including interferon gamma and 
interleukin-21, by an innate- 
like population of T cells called 
invariant natural killer T cells. 
Once activated, T-bet* B cells 
secrete the chemokine CXCL10 
and IgG2c antibodies, which 
contribute to macrophage 
recruitment and the aggravation 
of metabolic syndrome. T-bet* B 
cells could thus serve as a tar- 
get for treating the symptoms of 
obesity. —STS 
Cell Metab. 10.1016/ 
j.cmet.2022.07.002 (2022). 


science.org SCIENCE 


PHOTO: ANDRE SEALE/VW PICS/UNIVERSAL IMAGES GROUP/GETTY IMAGES 


RESEARCH 


ALSO IN SCIENCE JOURNALS é feasiiiiciadleick 


MOLECULAR BIOLOGY 
Driving translation 
in spermatids 


Translation control becomes 
a key mechanism for gene 
regulation during postmeiotic 
male germ cell differentiation 
because of the uncoupling 
between transcription and 
translation. Kang et al. identi- 
fied a member of the fragile 
X-related (FXR) protein family, 
FXRI1, as a translation activator 
for stored mRNAs to instruct 
spermiogenesis in mice (see 
the Perspective by Ramat and 
Simonelig). Mechanistically, 
highly elevated FXR1 in late 
spermatids forms condensates 
through phase separation to 
organize the stored mRNAs into 
FXRI1 granules and then recruits 
the translational machinery 
to activate translation. Either 
germline-specific depletion of 
Fxrl or modeling a phase sepa- 
ration—deficient mutation in 
Fxr1 similarly causes spermato- 
genic failure and male infertility 
in mice. —DJ 

Science, abj6647, this issue p. 727; 

see also add6323, p. 712 


PLANT SCIENCE 
Synthetic gene circuits 


Synthetic engineering offers 
the opportunity to build new 
functional circuits into exist- 
ing developmental programs. 
Brophy et al. have now designed 
a collection of synthetic regula- 
tory elements that they can use 
to control gene expression (see 
the Perspective by Alamos and 
Shih). A design-build-test strat- 
egy optimized function. With 
these synthetic regulatory ele- 
ments, the authors redesigned 
root development in the model 
plant Arabidopsis, quantitatively 
controlling lateral root density. 
The approach developed here 
may be useful for generating 
precise changes in a variety of 
phenotypes and organisms. 
—PJH 

Science, abo4326, this issue p. 747; 

see also add6805, p. 711 
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BIOMATERIALS 
Asound way to make 
it stick 
Tissue adhesives play a role in 
temporary or permanent tissue 
repair, wound management, 
and the attachment of wear- 
able electronics. However, it 
can be challenging to tailor the 
adhesive strength to ensure 
reversibility when desired and 
to maintain permeability. Ma 
et al. designed hydrogels made 
of polyacrylamide or poly(N- 
isopropylacrylamide) combined 
with alginate that are primed 
using a solution containing 
nanoparticles of chitosan, 
gelatin, or cellulose nanocrys- 
tals (see the Perspective by Es 
Sayed and Kamperman). The 
application of ultrasound causes 
cavitation that pushes the 
primer molecules into the tissue. 
The mechanical interlocking of 
the anchors eventually results 
in strong adhesion between 
hydrogel and tissue without the 
need for chemical bonding. Tests 
on porcine or rat skin showed 
enhanced adhesion energy and 
interfacial fatigue resistance with 
on-demand detachment. —MSL 
Science, abn86939, this issue p. 751; 
see also abq/021, p. 707 


ORGANIC CHEMISTRY 
A fluorinated cube 


The synthesis of a cube-shaped 
hydrocarbon nearly 60 years ago 
was a major event in chemistry, 
both because of the compound's 
beautiful, high symmetry and 
its distinctly unnatural bonding 
geometry. Sugiyama et al. have 
now synthesized and structurally 
characterized a cubane deriva- 
tive in which the hydrogen atoms 
at each vertex are replaced by 
fluorines (see the Perspective by 
Krafft and Riess). Consistent with 
theoretical predictions, low-tem- 
perature electron spin resonance 
spectroscopy suggests that the 
molecule centrally internalizes an 
electron upon reduction. —JSY 
Science, abqO516, this issue p. 756; 
see also adc9195, p. 709 


SPEECH EVOLUTION 
Complexity from 
simplification 
Human speech and language 
are highly complex, consisting 
of a large number of sounds. 
The human phonal apparatus, 
the larynx, has acquired the 
capability to create a wider array 
of sounds, even though previous 
work has revealed many simi- 
larities between our larynx and 
those in other primates. Looking 
across a large number of pri- 
mates, Nishimura et al. used a 
combination of anatomical, pho- 
nal, and modeling approaches to 
characterize sound production in 
the larynx (see the Perspective 
by Gouzoules). They found that 
instead of the human larynx 
having increased complexity, it 
has actually simplified relative 
to other primates, allowing for 
clearer sound production with 
less aural chaos. —SNV 

Science, abm15/74, this issue, p. 760; 

see also add6331, p. 706 


SEISMOLOGY 
Large-scale learning 


The large amount and availability 
of datasets in seismology create 
a great opportunity to apply 
machine learning and artificial 
intelligence to data processing. 
Mousavi and Beroza provide a 
comprehensive review of the 
deep-learning techniques being 
applied to seismic datasets, 
covering approaches, limitations, 
and opportunities. The trends 
in data processing and analysis 
can be instructive for geoscience 
and other research areas more 
broadly. —BG 

Science, abm4470, this issue p. 725 


MICROBIOLOGY 
STAND against 
viral invaders 


The innate immune systems 

of animals, plants, and fungi 
universally use nucleotide bind- 
ing oligomerization domain-like 
receptors (NLRs) of the STAND 
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superfamily to detect molecular 
patterns common to pathogens. 
Gao et al. show that NLR-based 
immune pattern recognition is 
also prevalent in bacteria and 
archaea, something that was 
not known before. In particu- 
lar, the authors characterized 
four families of NLR-like genes, 
finding that they are specific 
sensors for two highly con- 
served bacteriophage proteins. 
Upon binding to the target, 
these NLRs activate diverse 
effector domains, including 
nucleases, to prevent phage 
propagation. These findings 
demonstrate that pattern rec- 
ognition of pathogen-specific 
proteins is acommon mecha- 
nism of immunity across all 
domains of life. —DJ 

Science, abm4096, this issue p. 726 


INSECT MIGRATION 
Staying on course 


We still know little about how 
many migrating species navi- 
gate across vast distances. This 
is especially true for inverte- 
brates, which are challenging to 
monitor. However, new tech- 
nologies leading to extremely 
light, animal-mounted tags 
are opening up new research 
avenues in this area. Menz et 
al. used such tags to track the 
flight of death’s-head hawk- 
moths that migrate between 
Europe and sub-Saharan Africa. 
They found that the moths 
were able to correct to their 
specific course even in the face 
of disruptive winds and high 
mountains. This work suggests 
that the moths are not merely 
passively moving in the right 
direction, but instead actively 
navigating based on an internal 
map or compass. —SNV 
Science, abn1663, this issue p. 764 


TUMOR IMMUNOLOGY 
Do not feel excluded 


Chimeric antigen receptor 
(CAR)-T cells have revolution- 
ized the treatment of blood 
cancers, but the selection of 
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target antigens is hampered by 
the lack of understanding about 
how CARs are activated. Xiao 
et al. tested the size-exclusion 
hypothesis, whereby the bulky 
tyrosine phosphatase CD45, 
which is present in the immune 
synapse, inhibits CAR-T cell 
activation. They found that 
lengthening CARs decreased 
CD45 exclusion and CAR-T 
cell activation, whereas using 
larger CD45s in CAR-T cells 
or shortening target antigens 
led to increased CD45 exclu- 
sion and CAR-T cell activation. 
Lengthening CARs abrogated 
the in vivo efficacy of CAR-T 
cells in a mouse tumor model. 
Thus, manipulating the ability 
of CAR-T cells to better exclude 
CD45 from the immune syn- 
apse could improve CAR-T cell 
therapy. —DAE 

Sci. Immunol. 7, eabl3995 (2022). 


PROGERIA 
Understanding premature 
aging syndromes 
Heterochromatin suppresses 
repetitive and transposable 
elements, including long 
interspersed nuclear element 
(LINE) retrotransposons, but 
is lost during aging, lead- 
ing to derepression of these 
LINE elements. However, the 
consequences of this derepres- 
sion are not fully understood. 
Della Valle et al. studied LINE-1 
in typical and atypical progeroid 
syndromes. Nuclear LINE-1 
RNA expression occurred early 
in progeroid cells, resulting in 
suppression of the histone- 
lysine N-methyltransferase 
SUV39HI1, leading to hetero- 
chromatin loss and senescent 
phenotypes. Using antisense 
oligonucleotides to deplete 
LINE-1 RNA restored hetero- 
chromatin epigenetic marks 
and decreased the expression of 
senescence-associated genes 
in human cells and ina mouse 
model of Hutchinson-Gilford 
Progeria syndrome, resulting 
in increased life span. These 
findings suggest LINE-las a 
therapeutic target for prema- 
ture aging syndromes. —MN 
Sci. Transl. Med. 14, eabl6057 (2022). 
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SEISMOLOGY 


Deep-learning seismology 


S. Mostafa Mousavi* and Gregory C. Beroza 


BACKGROUND: Seismology is the study of seis- 
mic waves to understand their origin—most 
obviously, sudden fault slip in earthquakes, 
but also explosions, volcanic eruptions, glaciers, 
landslides, ocean waves, vehicular traffic, air- 
craft, trains, wind, air guns, and thunderstorms, 
for example. Seismology uses those same waves 
to infer the structure and properties of plane- 
tary interiors. Because sources can generate 
waves at any time, seismic ground motion is 
recorded continuously, at typical sampling 
rates of 100 points per second, for three com- 
ponents of motion, and on arrays that can 
include thousands of sensors. Although seis- 
mology is clearly a data-rich science, it often 
is a data-driven science as well, with new 
phenomena and unexpected behavior discov- 
ered with regularity. And for at least some 
tasks, the careful and painstaking work of 
seismic analysts over decades and around 
the world has also made seismology a data 
label-rich science. This facet makes it fertile 
ground for deep learning, which has entered 
almost every subfield of seismology and out- 
performs classical approaches, often dramati- 
cally, for many seismological tasks. 


ADVANCES: Seismic wave identification and 


onset-time, first-break determination for seis- 
mic P and S waves within continuous seismic 


Domain knowledge 


( a _ x ys . Complex mapping 


Data 


data are foundational to seismology and are 
particularly well suited to deep learning be- 
cause of the availability of massive, labeled 
datasets. It has received particularly close 
attention, and that has led, for example, to the 
development of deep learning-based earth- 
quake catalogs that can feature more than an 
order of magnitude more events than are 
present in conventional catalogs. Deep learning 
has shown the ability to outperform classical 
approaches for other important seismological 
tasks as well, including the discrimination of 
earthquakes from explosions and other sources, 
separation of seismic signals from background 
noise, seismic image processing and interpre- 
tation, and Earth model inversion. 


OUTLOOK: The development of increasingly 
cost-effective sensors and emerging ground- 
motion sensing technologies, such as fiber 
optic cable and accelerometers in smart de- 
vices, portend a continuing acceleration of 
seismological data volumes, so that deep learn- 
ing is likely to become essential to seismology’s 
future. Deep learning’s nonlinear mapping abil- 
ity, sequential data modeling, automatic fea- 
ture extraction, dimensionality reduction, and 
reparameterization are all advantageous for 
processing high-dimensional seismic data, 
particularly because those data are noisy and, 
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from the point of view of mathematical in- 
ference, incomplete. Deep learning for scien- 
tific discovery and direct extraction of insight 
into seismological processes is clearly just 
getting started. 

Aspects of seismology pose interesting ad- 
ditional challenges for deep learning. Many of 
the most important problems in earthquake 
seismology—such as earthquake forecasting, 
ground motion prediction, and rapid earth- 
quake alerting—concern large and damaging 
earthquakes that are (fortunately) rare. That 
rarity poses a fundamental challenge for the 
data-hungry methods of deep learning: How 
can we train reliable models, and how do we 
validate them well enough to rely on them 
when data are scarce and opportunities to test 
models are infrequent? Further, how can we 
operationalize deep-learning techniques in 
such a situation, when the mechanisms by 
which they make predictions from data may 
not be easily explained, and the consequences 
of incorrect models are high? Incorporating 
domain knowledge through physics-based and 
explainable deep learning and setting up stan- 
dard benchmarking and evaluation protocols 
will help ensure progress, as is the nascent 
emergence of a seismological data science 
ecosystem. More generally, a combination of 
data science literacy for geoscientists as well 
as recruiting data science expertise will help to 
ensure that deep-learning seismology reaches 
its full potential. m= 
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Deep-learning processing of seismic data and incorporation of domain knowledge can lead to new capabilities and new insights across seismology. 
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SEISMOLOGY 


Deep-learning seismology 


S. Mostafa Mousavi"* and Gregory C. Beroza? 


Seismic waves from earthquakes and other sources are used to infer the structure and properties of Earth’s 
interior. The availability of large-scale seismic datasets and the suitability of deep-learning techniques for 
seismic data processing have pushed deep learning to the forefront of fundamental, long-standing research 
investigations in seismology. However, some aspects of applying deep learning to seismology are likely 

to prove instructive for the geosciences, and perhaps other research areas more broadly. Deep learning is 
a powerful approach, but there are subtleties and nuances in its application. We present a systematic 
overview of trends, challenges, and opportunities in applications of deep-learning methods in seismology. 


eismology is a data-rich and data-driven 

science in which the application of deep- 

learning techniques is growing rapid- 

ly (Fig. 1). Deep learning (7) uses deep 

neural networks (DNNs) to learn com- 
plex relationships, find multiscale structure 
and patterns, and address ill-posed inverse 
problems by developing high-level representa- 
tions of data using stacked layers of neurons 
and multiple nonlinear transformations. The 
abilities of DNN variants—such as convolu- 
tional neural networks (CNNs) to extract multi- 
scale features from data and recurrent neural 
networks (RNNs) to model temporal relations 
in sequential data—make them powerful tools 
for seismology. 

Deep-learning approaches have entered 
almost every subfield of seismology, for which 
they have shown the ability to outperform 
classical approaches, often dramatically, for 
seismological tasks such as denoising, earth- 
quake detection, phase picking, seismic image 
processing and interpretation, and inverse and 
forward modeling. Some properties of DNNs— 
such as their universal approximation capabil- 
ity, automatic feature extraction, and dimen- 
sionality reduction—have been shown to be 
particularly advantageous in processing high- 
dimensional seismic recordings, which often 
are noisy and incomplete (Fig. 2). Seismolog- 
ical deep learning can process massive amounts 
of multifidelity seismic observations with un- 
precedented spatiotemporal coverage and lead 
to new insights and discoveries (2). Deep learn- 
ing may be particularly effective for seismolog- 
ical problems for which the underlying physical 
processes are incompletely understood but for 
which the data are abundant and of high quality. 

We present a systematic overview of deep- 
learning methods in seismology by sampling 
the current state of the art and through a 
meta-analysis of 637 journal papers published 
between January 1988 and January 2022 (3). 
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We conclude with some speculation on future 
trends and with recommendations intended 
to accelerate progress. 


Data processing automation 


Seismic data are recorded (often irregularly 
or heterogeneously) as time series of ground 
motion by sensors that are deployed either 
sparsely (in a network) or densely (in an array) 
and that register acceleration, velocity, or 
displacement as output. Typically, these quan- 
tities are recorded in three perpendicular direc- 
tions, so that we work with three-component 
vector ground motion. Important extensions 
to conventional seismic recording include ro- 
tational motion (4) and tensorial strain (5). 
In both passive and active-source seismol- 
ogy, usually a sequence of processing and 
analyzing steps exists that needs to be per- 
formed. Despite decades of earlier efforts to 


develop algorithms to automate such data- 
processing and -analyzing tasks, many cases 
remain for which manual processing by skilled 
analysts is the most reliable option (such as 
phase picking or seismic interpretation). The 
accelerating expansion of seismic data vol- 
umes poses new challenges and brings with 
it the need to develop a new generation of 
robust processing tools by using data-driven 
approaches such as deep learning. 


Seismic event monitoring 


The classification capability of DNNs makes 
them ideal for processing tasks such as event 
discrimination (6-8), event detection (9, 10), 
phase picking (//, 12), first-polarity determi- 
nation (73, 14), and phase association (15, 16) 
that together form a seismicity monitoring 
workflow. 


Seismic event discrimination 


Various studies have shown outstanding per- 
formance of deep learning for different event 
discrimination tasks, such as differentiating 
between human-made explosions and natural 
earthquakes (6, 7), volcano-tectonic earth- 
quakes and volcanic tremor (8), mining-related 
events (17), tectonic tremor and local earth- 
quakes (18), and teleseismic and local earth- 
quakes (19). The main challenge in developing 
discriminator models is the scarcity of large- 
scale labeled datasets. Efforts to address this 
include semisupervised learning (20), unsuper- 
vised learning (19), and transfer learning (8). 


Earthquake detection and phase picking 


Identification of earthquake signals among 
nonstationary background noise and a variety 
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Fig. 1. A surge of interest in deep-learning applications in seismology. Cumulative number of published 
papers in seismology between January 1988 and January 2022 that used neural networks (red curve). 
Bars show the average number of hidden layers of neural networks—the depth of networks. The color scale 
depicts the average size of training data used to train the networks. 
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Fig. 2. The properties of DNNs for the processing of seismic data. Various properties of DNNs such as their universal approximation capability, automatic feature 
extraction, and dimensionality reduction are advantageous for processing large-volume, high-dimension seismic recordings, which often are noisy and incomplete. 

These attributes are used to build powerful models for various seismological tasks such as (i) seismic data denoising, (ii) interpolation of missing traces, (iii) earthquake signal 
detection and phase picking [reprinted with permission from (43)], (iv) exploratory data analysis [reprinted with permission from (123), copyright Seismological Society 
of America], (v) salt body detection, (vii) seismic velocity model building, (vii) earthquake location estimation, or (viii) ground motion prediction. 


of nonearthquake signals, from both natural 
and anthropogenic sources, as well as precise 
measurement of P wave or S wave arrival 
times, is a challenging task—we differentiate 
between discrimination and the detection task 
in which the goal is to identify only one signal 
of interest among a variety of recorded events. 
The huge archives of seismic data and the avail- 
ability of hand-picked labels from earthquake 
catalogs made these tasks to be among the 
earliest and the most popular applications 
of data-hungry deep learning in seismology. 
The potential of deep learning for these tasks 
was evident from the earliest deep learning- 
based studies (9-17, 21). Since then, more than 
80 models have been developed to explore the 
performance gain obtained through more com- 
plex deep-learning architectures, such as atten- 
tive CNN-RNN (22, 22) and Capsule Networks 
(23). Researchers have also explored the advan- 
tages and disadvantages of performing these 
tasks in the time domain (J0, 17), the time- 
frequency domain (27), and a fusion of time 
and time-frequency (24); of using multistation 
data (25, 26) and multisensor data (GPS and 
seismic); of using two observation levels (seis- 
mic station level and network level) (27); and of 
performing the two tasks sequentially (28, 29) 
or simultaneously (72), on data recorded with 
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low-cost microelectromechanical systems 
(MEMS) accelerometers (30), distributed acous- 
tic sensing (DAS) (37), or ocean-bottom seis- 
mometers (OBSs), and geophones (25). 
Deep-learning detectors and phase-pickers 
can be thought of as universal templates of 
earthquake signals represented by a set of 
kernels of different sizes at different layers of a 
neural network and the connection weights 
between them. The outstanding performance 
of these models comes from their ability to 
learn a deep and compact representation of 
diverse templates; however, because of the lack 
of standardized benchmarking, the optimal 
deep-learning approach for these tasks is un- 
clear. Nevertheless, common observations 
and conclusions are that (i) deep-learning 
models systematically outperform classical 
characteristic function-based models; (ii) deep- 
learning model performance approaches that 
of template-matching methods, but with much 
lower computational cost and no need for tem- 
plate events; (iii) deep-learning models can 
pick phase-arrival times with a similar picking 
error to that of skilled analysts while picking 
far more phases in much less time; (iv) the 
power of artificial intelligence (AI)-based 
pickers is most evident for the challenging task 
of picking S waves that emerge from P coda; 
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(v) deep-learning models can generalize well 
to other regions or data types beyond their 
training data; and (vi) compared with the clas- 
sical methods, deep-learning models perform 
better in conditions with lower signal-to-noise 
ratio (SNR). As a result, deep-learning models 
provide more complete earthquake catalogs 
and much-higher-resolution maps of seismicity 
by detecting more small events. 


Seismic image processing 

Reflected and refracted seismic waves recorded 
by an array of seismic sensors are used to image 
subsurface geology and structure. Deep learn- 
ing has proven an effective tool in the pro- 
cessing steps used to improve the quality of 
seismic images and to transform them into 
an interpretable image of the subsurface by 
removing data acquisition artifacts and wave 
propagation effects to highlight events that 
more accurately portray the true geology and 
structure. 


Seismic denoising 


Recorded seismic data are always contaminated 
by unwanted background noise or interference 
from other signals. This unwanted energy is 
often nonstationary with incoherent (random 
or Gaussian) and/or coherent components 
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that can mask the signal of interest and affect 
subsequent inversion or interpretation. Seis- 
mic denoising (32) and closely related tasks 
such as multiple removal (33) and deblending 
(34) are a focus area for the applications of 
deep learning in seismology. These studies 
have shown the promising performance of 
deep learning for suppressing a wide variety of 
noise in seismic data, including the random 
noise (35), coherent low-frequency noise (36), 
swell noise (37), ground roll (38), and coherent 
and incoherent noise (32) in conventional and 
unconventional seismic data such as DAS (39) 
and in one-dimensional (1D) (32), 2D (40), and 
3D (41) recordings. The effectiveness of deep- 
learning models in suppressing various types 
and levels of seismic noise with low computa- 
tional cost is remarkable. 

In deep-learning denoising, the statistics 
of signal and noise are directly learned from 
training data. As a result, the learned models 
achieve superior performance and are more 
flexible than conventional methods that rely 
on prior assumptions about the statistics of 
the noise or signal. The out-of-distribution 
generalization of their superior performance, 
however, depends highly on how well their 
training data represent the distributions of 
field data. A challenge in building training data 
for deep-learning denoisers is to obtain noise- 
free signals and signal-free noise from field 
recordings. An effective approach to meet this 
challenge is to create semisynthetic data in 
which clean synthetic signals are combined 
with manually picked noise from the field data 
(36, 39). Performing unsupervised denoising 
by using either autoencoders (35, 40) or gen- 
erative adversarial networks (GANS) (36, 38) 
are other common approaches. The idea behind 
autoencoder denoisers is that the most repre- 
sentative features of the data that are automat- 
ically extracted by the autoencoder have a more 
coherent structure and represent signals of 
interest. Hence, autoencoders are usually only 
effective for random noise suppression. Gen- 
erative models, on the other hand, learn the 
domain mapping from noisy data domain 
into the signal domain and could work for 
incoherent noise as well, but require examples 
of clean and noisy data. 


Seismic image acquisition and processing 


Tasks under this heading include identifica- 
tion of shots and receivers with position errors 
(42), interpolation for reconstructing traces 
missing during acquisition (43), extrapolation 
to low-frequency components on the basis of 
high-frequency energy (44) or vice versa (45), 
identification of the first arriving seismic en- 
ergy in time-space domain (46), automation of 
velocity picking (47), extraction and classifica- 
tion of dispersion curves (48), and seismic 
migration. Although velocity picking and 
migration can be treated as an inversion pro- 
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cess, we discuss them here under seismic image 
processing. Tasks in which the main goal is 
the inversion for some physical properties are 
discussed in a subsequent section (49, 50). 
The results of these studies suggest impressive 
capabilities of DNNs in performing subtle and 
complex processing tasks with unprecedented 
computational efficiency. For example, deep- 
learning seismic interpolation methods can 
successfully reconstruct missing regularly or 
irregularly spaced data with flexibility similar 
to that of traditional methods based on wave 
equation and signal processing, but without 
their high computational costs or limiting 
assumptions such as sparsity or low rank. The 
efficiency gain is evident from applications for 
seismic migration. Using a CNN, Liu et al. (49) 
accelerated prestack time migration by more 
than 100 times, and Kaur e¢ al. (50) improved 
both computational efficiency and image reso- 
lution in least-squares migration by estimat- 
ing the inverse Hessian operator using a cyclic 
GAN and eliminating the need for iteration. 


Seismic image interpretation 


Seismic image interpretation aims to extract 
useful information regarding subsurface ge- 
ology (such as rock type) or structure (such 
as fault, horizon, and salt body) from seismic 
images that have limited resolution and are 
contaminated with noise. Many deep-learning 
techniques have been developed to automate 
and assist seismic-image interpretations that 
are subjective and often require substantial time 
and domain knowledge from the interpreter. 

The feature-learning and dimensionality- 
reduction abilities of CNNs are well suited 
for clustering of seismic response in terms of 
geological properties or facies (51, 52) and out- 
perform machine-learning approaches for seis- 
mic facies mapping based on hand-engineered 
features. Deep-learning models, however, have 
the potential to go beyond simple pattern rec- 
ognition to provide accurate end-to-end clas- 
sification results (53-55) while quantifying the 
uncertainties (56). 

Besides geological interpretation, the power 
of deep learning for image segmentation and 
object detection makes it a popular choice for 
structural-interpretation tasks such as auto- 
matic identification of geological channels (57), 
bright spots (58), collapsed karst features (59), 
scatterers (60), bottom-simulating reflections 
(61), salt bodies (62), horizons (63, 64), and 
faults (65-67) in seismic images. 

Seismic image interpretation is a type of 
image understanding. Hence, applications of 
popular neural network architectures devel- 
oped for computer vision are a natural choice. 
The remarkable performance of these methods 
for seismic image interpretation—despite the 
scarcity of large, complete, and precisely la- 
beled training datasets—is attributable to a 
combination of the extensive use of data aug- 
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mentation and the generation of sophisticated 
synthetic data. 


Forward problems 


The impressive predictive power of DNNs 
makes them popular tools for forward mod- 
eling problems in seismology. We outline 
three categories—seismic wave simulation, 
ground motion characterization, and earth- 
quake forecasting—as instructive examples 
of forward-modeling tasks in seismology for 
which DNNs have been used. 


Seismic wave simulation 


To date, applications of deep learning for seis- 
mic wave simulation have been limited. Never- 
theless, deep learning appears to provide an 
effective alternative for standard numerical 
methods and can address issues such as dis- 
cretization errors and high computational com- 
plexity. The use of physics-informed neural 
networks (PINNs) (68-77) and GANs (72-74) 
are two current trends. In PINNs, a DNN is 
trained to learn the solution of the wave equa- 
tion for a medium by implicitly defining the 
boundary conditions and the wave equation 
in the loss function used in training of the 
neural network. These deep-learning models 
can learn to solve the wave equation in 2D or 
3D media, even for complex faulted structure 
or topography, and generalize well beyond the 
time stamps of their training dataset. They are 
much more efficient in computing arbitrary 
space-time points in the wavefield than are 
traditional numerical simulations and reduce 
computation time by at least an order of 
magnitude. 

GAN simulators use the universal function 
approximation ability of DNNs to learn the 
probability distributions of attributes of train- 
ing data by optimizing a generator network. 
This generator model can be used as repa- 
rameterization of such distributions to generate 
new samples drawn from the learned distribu- 
tions during inference. GANs have been used 
to generate synthetic earthquake and non- 
earthquake seismograms as a data augmenta- 
tion tool for training deep-learning earthquake 
detectors (72); to generate broadband seismic 
signals by blending the low-frequency output 
of numerical physics-based simulations with 
sparsely sampled broadband observations (73); 
and to generate 3C strong motion time series 
for different magnitude, distance, and site 
conditions (74). They provide an efficient 
framework for generating large-scale synthetic 
training data to improve the performance of 
deep-learning classifiers and detectors. 


Ground motion attribute modeling 


Another category of forward models provides 
estimates of distinct attributes of ground 
motion caused by a seismic source such as 
peak ground acceleration (PGA) (75, 76) or 
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the travel time of a seismic wave between two 
points in a medium (77). 

Neural networks have been used to build 
nonparametric ground-motion models (GMMs) 
based on a fixed set of input parameters by 
using field (78) or synthetic (79) data. This 
group of AI-based GMMs (which is more than 
15 years old) has similar input-output and 
performance to those of traditional methods; 
however, another group of deep learning- 
based GMMs introduces a more progressive 
generation of GMMs (75, 76, 80) under a new 
paradigm. These models can provide a rapid 
estimation of ground shaking directly from 
observed waveforms without an interpretation 
through earthquake source parameters or site 
conditions. A network-level pattern of ground- 
motion propagation contains enough infor- 
mation for a DNN to nowcast PGA at stations 
that have not been shaken by the earthquake 
waves yet, on the basis of those stations that 
have been shaken (76, 80). This group of end- 
to-end approaches seems particularly relevant 
for earthquake early warning (EEW). 

Deep learning-based Eikonal solvers (77) 
have much lower space and time complexity 
compared with those of the traditional finite- 
difference approaches (especially for complex 
structures) because they eliminate the need 
for travel-time lookup tables and greatly ben- 
efit from the parallel computing capability of 
deep-learning platforms. 


Earthquake forecasting 


Solving the earthquake forecasting puzzle by 
using neural networks is an appealing topic 
(81). Early attempts relied on hand-engineered 
features, but the emphasis has shifted to neural 
network-based forecasting approaches that 
more directly incorporate the complete spatio- 
temporal structure of seismic catalogs (82, 83). 
This exploits the automatic feature extrac- 
tion ability of some DNNs along with their 
potential for learning the dynamics of data 
to take a fresh look at an old problem. Most 
of these studies are in an experimental phase 
and focus on comparing the relative perform- 
ance of (usually not fine tuned) different deep- 
learning and/or classical machine-learning 
predictors. It remains to be established whether 
deep-learning earthquake forecasting models 
offer new insights into earthquake predict- 
ability and whether they perform better than 
traditional empirical approaches (84). 

The rapid progress in deep learning-based 
earthquake monitoring has resulted in deeper 
earthquake catalogs with unprecedented spa- 
tial resolution and magnitude of completeness 
that should improve both traditional statis- 
tical and physics-based forecasting methods 
(85) as well as Al-based approaches. Here too, 
the predictive value of newly uncovered small 
earthquakes in deep learning-based catalogs 
needs to be clarified (86). A key challenge is to 
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effectively explore these complex and denser 
catalogs and extract new relationships. This 
is a task that deep learning can help with as 
well. For example, Zhu et al. (87) present a 
reinforcement-learning approach to model 
complex spatiotemporal relationships in earth- 
quake catalogs that are not possible to mod- 
el by using classical approaches such as the 
epidemic-type aftershock sequences (ETAS) 
method. 

Deep learning offers new opportunities in 
earthquake forecasting by blending different 
potential precursory data and known earth- 
quake physics with seismicity data and un- 
derlying physics that remain unknown. The 
growing number of forecasting competitions 
such as in (88), richterx (89), or AETA (90) is 
important for engaging broader contributions 
beyond the earthquake seismology commu- 
nity. More effort in setting up the technical 
rigor and benchmarking practices is needed to 
direct this new wave of enthusiasm into 
constructive contributions toward earthquake 
forecasting. 


Inverse problems 


Solving often nonlinear and ill-posed inverse 
problems forms another popular category of 
applications of deep learning in seismology. 
Deep learning for estimating physical param- 
eters from observed seismic data can be divided 
into two main categories: (i) characterizing 
Earth’s subsurface properties and (ii) charac- 
terizing the earthquake source. 


Subsurface characterization 


Various characteristics of DNNs have proven 
effective for solving inverse problems in seis- 
mology. Their ability to learn highly nonlinear 
relationships is the main idea behind fully data- 
driven end-to-end approaches for inferring a 
wide variety of subsurface properties such as 
seismic velocity (97-93), impedance (94, 95), 
petrophysical attributes (96, 97), dynamic res- 
ervoir properties (98, 99), groundwater distri- 
bution (700), and anelastic attenuation (J07) 
directly from either pre-stack (92) or post-stack 
(102) seismic data. 

Deep-learning applications for seismic in- 
version are not limited to end-to-end map- 
pings. In conventional seismic inversion, 
deep learning can be used to learn an optimiza- 
tion algorithm (703), to introduce data-driven 
elements into an optimization procedure 
(104), or to replace computationally expen- 
sive steps (105). The representation-learning 
and dimensionality-reduction capabilities of 
DNNs are valuable attributes for seismic inver- 
sion. In full-waveform inversion (FWI), these 
characteristics are used to transform the large 
inverse problems by either skeletonizing the 
data (106) or reparametrizing the Earth model 
(107, 108) as the connection weights and/or 
learnable kernels in a DNN. Although reducing 
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the dimension of observed and synthetic seis- 
mic data by use of DNNs can reduce the non- 
linearity of the inverse problem and the risk of 
converging to a local minimum (J06), the pro- 
vided sparse representation of an Earth model 
in a lower-dimensional latent space reduces 
the computational cost and effectively regu- 
larizes the problem (/03, 109). Representing 
Earth models with sparse multiscale feature 
maps through a DNN acts like a regularization 
technique by constraining the updates to the 
effective dominant features that represent 
the velocity model (107, 109). In addition, the 
differentiable representation of the Earth 
model facilitates the inclusion of physics- 
based constraints (95, 110). The flexibility of 
the open-source deep-learning platforms (such 
as Tensorflow or PyTorch) and their appealing 
computational capabilities (such as automatic 
differentiation and GPU parallel computing) 
motivate reformulating the entire FWI into 
a DNN with forward modeling guided by 
physics laws through neuron computations 
(111). Amore advanced example of this is the 
development of specialized deep-learning ar- 
chitectures that are capable of learning the 
internal mechanism of general wave-based 
inversions (172). 

These studies demonstrate the exceptional 
performance of deep-learning techniques in 
providing fast and reliable solutions to large- 
scale seismic inverse problems. Moreover, the 
processing time for both training and infer- 
ence of deep-learning models can be an order 
of magnitude less than optimization time of 
classical seismic inversion approaches. Deep- 
learning inversion methods also tend to be 
less sensitive to noise (173) and missing low- 
frequency components (770) compared with 
thoses of classic FWI. They either are not sen- 
sitive to the choice of initial velocity models or 
do not require such initialization (97, 93). Last, 
the modularity of deep learning enables the 
processing of multimodal input data such 
as well-log and seismic or gravity and elec- 
tromagnetic data, which facilitates joint inver- 
sion (1/4). 


Earthquake source parameterization 


Deep learning has great potential to provide 
near-real-time earthquake source parame- 
ters, such as hypocenters, magnitudes, and 
source mechanisms. Deep learning is effective 
for end-to-end learning of earthquake source 
properties based on waveform patterns prop- 
agating across a network of seismic stations 
(115-117). Tt can also be used to go beyond con- 
ventional multistation approaches by character- 
izing the source parameters from single-station 
data (118, 119). 

Although multistation approaches are aligned 
with conventional methods, to date they suffer 
from smaller training datasets (usually a few 
tens of thousands of events). Single-station 
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approaches are in an experimental phase 
that has explored the feasibility of maximizing 
the information that can be extracted from 
smaller units of observation (short windows of 
waveforms recorded by a single instrument) 
by using the feature learning and high non- 
linearity of DNNs. The latter approach has 
the potential to enable monitoring of smaller 
events by using limited observations (or sparse 
network coverage) and benefits from much 
larger training datasets (hundreds of thousands 
of single-station observations) available for 
training deeper DNNs. 

Overall, deep-learning applications for source 
characterization are in an earlier stage than 
those for subsurface inversion or automatic 
data processing. They encounter the challenges 
of the unavailability of genuine ground truth 
and the scarcity of large events (earthquakes 
of 7.0 magnitude or greater) in the training 
set. Incorporating the aleatory and epistemic 
uncertainties into the modeling procedure and 
physical constraints through physics-guided 
(720) or physics-informed (127) DNNs could 
be a promising direction to overcome these 
limitations. 


Exploratory data analyses 


Most applications of deep learning in seis- 
mology have been related to automation and 
modeling tasks; however, deep learning 
can be used in discovery mode to find pat- 
terns and insight in seismic data as well. 
Seydoux et al. (2) developed an unsupervised 
framework that combines a deep scattering 
network and a Gaussian mixture model to 
identify patterns in continuous seismic record- 
ings through segment clustering. They used 
this to reveal the repeating precursory seis- 
micity preceding a landslide in Greenland. 
This method has also been used for clustering 
of daily quasi-periodicity and numerous tran- 
sient microevents on continuous seismic data 
recorded on Mars (122). A similar approach 
has been used to cluster urban seismic noise 
in Long Beach to identify their sources (123). 
In this approach, features that represent dis- 
tinct clusters most effectively are extracted 
from input data by using a deep CNN through 
simultaneous optimization of clustering and 
feature-learning tasks. Unsupervised deep 
learning holds great potential for uncovering 
new insights in continuous seismic data. The 
intelligent feature extraction and dimensional 
reduction mechanism of this new group of 
DNNs (9, 124) render them well suited for 
exploratory analyses of high-dimensional seis- 
mic data such as DAS. 


Beyond conventional seismology 


Seismology has many unconventional appli- 
cations. The sensitivity of seismic sensing 
to the Doppler effect makes it an effective 
approach for the monitoring of moving objects 
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such as humans, vehicles, and aircraft (725). 
Deep learning has also been used to detect 
human footsteps (126, 127), discriminate be- 
tween different classes of targets (such as 
humans versus vehicles) (128, 129) and different 
types of vehicles (such as light vehicles, heavy 
vehicles, wheeled vehicles, and tracked ve- 
hicles) (130), or discriminate between differ- 
ent wildlife species and their behavior (137). 

These unconventional applications of seis- 
mic sensing bring challenges and motivate 
technical approaches that could be of interest 
to the larger seismological community. For 
example, Bin et al. (128) showed that through 
on-site feature learning and classification by 
using CNNs to compress data and only trans- 
mitting the classification result to the server, 
it is possible to obtain similar results to those 
of state-of-the-art cloud-based deep-learning 
methods at only 1/10 of the computational 
time. Similar approaches could help reduce 
current challenges in seismic monitoring by 
use of fiber optic cables. Compressed sensing 
of seismic data (132), fusion of multisensor 
data (129), and federated learning (133) are 
other areas in which deep-learning techniques 
could assist in processing large data volumes. 

The Nobel prize-winning detections of gravi- 
tational waves and subsequent groundbreaking 
discoveries made by using the advanced Laser 
Interferometer Gravitational-wave Observatory 
(LIGO) were made possible through a series of 
multidisciplinary research efforts. The advances 
in deep-learning seismology discussed above 
have the potential to contribute to gravity- 
wave measurements in multiple ways. Despite 
sophisticated seismic isolation systems, ground- 
based gravitational wave interferometers re- 
main susceptible to ground shaking from large 
teleseismic earthquakes, which are one of the 
primary sources of interruption of the proper 
functioning of the interferometers and sub- 
stantially reduce their duty cycle (134). Machine 
learning for ground motion characterization 
and EEW could help reduce that impact. Deep 
learning can also be used to predict on the basis 
of archival seismic data the ground-motion 
level and state of the gravitational wave de- 
tector during the event of an earthquake (735). 

The template-matching algorithms used for 
the detection of gravitational waves can be 
complex and computationally expensive. Sim- 
ilar to earthquake signal detection and 
discrimination, deep learning has been shown 
to be an effective alternative for efficient de- 
tection and classification of gravitational-wave 
signals from different cosmic sources (136). 
DNN models can detect and estimate the true 
parameters of real events observed by LIGO 
with comparable sensitivity and lower error 
than that of template-matching, while being 
several orders of magnitude faster and more 
resilient to transient noise artifacts (136). These 
gravitational waves are extremely weak signals 
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embedded in highly non-Gaussian and nonsta- 
tionary noise owing to instrumental artifacts 
and environmental contamination. Denoising 
by use of deep-learning approaches and with 
the supervision of accompanied recorded seis- 
mograms is another approach for the detec- 
tion of new events below the noise floor (737). 


Discussion 
Meta-analysis 


To date, automation of seismic data process- 
ing and solving of inverse problems form the 
majority of deep-learning applications in seis- 
mology (Fig. 3). Although deep learning can be 
used to recognize patterns and extract insights 
from high-dimensional seismic data, their ap- 
plication for exploratory analysis of seismic 
data and scientific findings are largely un- 
explored (Fig. 4A). This might be due to an 
atmosphere of skepticism regarding the black 
box nature of deep-learning models. In most 
seismic data-processing tasks, however, the 
cost of a wrong answer is low relative to the 
value of a correct answer. For example, the risk 
of picking wrong or inaccurate phase arriv- 
als in earthquake monitoring is low because 
wrong or inaccurate arrivals will usually be 
eliminated or refined in subsequent process- 
ing. Thus, having a deep-learning picker that 
can do this task with high efficiency but some 
brittleness, which is defined as the tendency to 
get fooled (138), and imperfect precision does 
not undermine the utility of the method. Even 
in applications such as EEW, for which the cost 
of a wrong answer may be high, deep learning 
may offer the best available approach (76). 
Last, although deep learning cannot be used 
to establish causation or understand pro- 
cesses directly from data, it can be used to 
inspire and direct researchers toward scien- 
tific discoveries. 

The scarcity of large-scale labeled field data 
for model building is more evident in active 
seismology and for applications such as seis- 
mic image processing, interpretation, and 
subsurface characterization (Fig. 3). This 
scarcity is due to both the limited number of 
large-scale and publicly available datasets and 
the subjectivity of labeling for these tasks. 
Earthquake signal processing tasks—such as 
seismic phase picking, phase association, and 
earthquake detection—benefit from larger and 
publicly available training datasets. 

Supervised learning has dominated deep- 
learning approaches used in seismology to date. 
However, important applications for alternative 
unsupervised and semisupervised approaches 
are being increasingly found. Unsupervised 
learning has focused on seismic facies analy- 
ses, seismic denoising, event discrimination, 
and exploratory data analyses. As the num- 
ber of developed models for each seismolog- 
ical task grows, techniques such as ensemble 
learning are likely to find more applications. 
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Fig. 3. Deep-learning applications for seismological tasks. The size of each circle is scaled to the number of published papers for each application and color-coded 
according to the average size of training data. Horizontal axes represent the average percentage of observational and recorded data versus synthetic training data. 
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Fig. 4. Hierarchical distribution of studies in our database. (A and B) Studies are distributed according to (A) the seismological applications and (B) used DNNs. 


Fine tuning of trained models, especially those 
trained with synthetic data, is popular in seis- 
mology, and we expect that meta-learning will 
be a growth area in seismology. 

CNNs dominate seismological models across 
many sectors (Fig. 4B) because of their effec- 
tiveness for automatic feature extraction and 
sparse representation learning of seismic data. 
Some characteristics of seismic data such as 
their limited bandwidth, sequential nature, 
or data structure (for example, 3D and 4D 
imaging in active seismology) have motivated 
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developments of specialized and hybrid DNN 
architectures composed of a mixture of neural- 
layer types. RNNs are popular in tasks in 
which temporal relations in data play a crucial 
role in the modeling. Network architectures 
with contracting encoders and expanding de- 
coders such as U-Net and Autoencoders are 
highly suitable for modeling seismic data across 
many applications. GANs are another popular 
approach for seismic data processing. Among 
RNNs, long short-term memory (LSTM) units 
are the most popular network type. The se- 
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quential nature of seismic data makes RNNs 
appealing for many seismological tasks. More 
recently, neural networks with attention mech- 
anisms, dilated CNNs, and physics-informed 
DNNSs are gaining momentum. 


Explainable deep learning 


Despite the widespread application of deep 
learning in seismology and its breakthrough 
performance in multiple areas, the black-box 
nature remains a source of skepticism. Accept- 
ing indecipherable models on their own terms 
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can contribute to scientific goals of the com- 
munity (139), but presenting the properties 
of these models in understandable terms to 
humans (interpretability) and revealing the 
underlying reasons for their output decision 
(explainability) (740) would allay such skep- 
ticism and lead to wider adoption and new 
insights. 

Currently, most deep-learning approaches 
in seismology concern building a model based 
on known input-output pairs to predict the 
outputs corresponding to previously unseen 
inputs without interpretability or explainabil- 
ity. In most of these cases, it is not possible to 
explain why specific outcomes were obtained 
from an algorithmic point of view nor what 
else these models can tell us. 

Visualizing the learned kernels at differ- 
ent layers of a CNN is the main approach 
used in some of the seismological studies 
(93, 107, 141, 142) to understand something 
about the representations a deep learning 
model has learned and how it drives the out- 
put; however, the extracted multiscale feature 
representations from seismic data are too 
complex for direct visualization to provide 
interpretable and understandable insights. 

Although it may not be possible to elucidate 
how a deep-learning model works, gaining 
useful information about learned models is 
possible by using interpretation tools such as 
backward propagation techniques, saliency 
maps, or heatmaps to reveal relevant patterns 
in the input based on feature importance or 
relevance scores (143, 144). These interpret- 
able representations of the input can be used 
along with domain knowledge to explain the 
output. 

Mousavi et al. (12), for example, used atten- 
tion mechanisms in their detection and picker 
DNN. They used a hierarchy of attention mod- 
ules to gain insight into the interaction of task- 
specific decoder branches and where and on 
what the DNN focused. Doing so gave a degree 
of interpretability to their results. Attention 
was used in (54) to build an interpretable deep- 
learning model for seismic facies analysis. They 
used 3D spatial-spectral attention maps to 
reveal the relations between the seismic spec- 
tral response (input) and the geological forma- 
tions (output). Liao et al. (22) used a backward 
propagation technique to compare different 
deep-learning models for phase-picking on the 
basis of the learned representations from data. 

In addition to explaining the outcomes, 
designing an interpretable deep-learning 
model is possible in which the model itself 
and its components became explainable by 
using domain knowledge (J//, 145). In this 
approach, the network architecture is de- 
signed to provide insight by promoting or 
restricting specific attributes of the data being 
modeled (146, 147). Zhu et al. (87), for ex- 
ample, used a mixture of Gaussian diffusion 
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kernels to build an interpretable model of 
anisotropic spatial dependency in seismic- 
ity patterns. The parameters of these kernels 
are reparameterized by DNNs and learned 
from training data by using an imitation 
learning approach. Such an interpretable 
model can help uncover patterns [and hopefully 
causal relationships (748)] in data. 


Incorporating geophysics into DNNs 


Deep-learning approaches that incorporate 
physical laws have gained momentum in the 
machine learning community (149) and a 
growing number of implementations in seis- 
mology. The main idea is to integrate data and 
mathematical physics (domain knowledge) 
models, even if only partially understood. The 
key objectives are to provide interpretability 
and explainability for a deep-learning method, 
to regularize them in the presence of missing 
or noisy data, and to enhance their out-of- 
distribution generalization by restricting the 
solution space to physically plausible solutions. 

Domain knowledge can be incorporated into 
the training data, the hypothesis space, and the 
training procedure of a deep-learning model 
(Fig. 5). Among these, incorporating the do- 
main knowledge into training data through 
data generation and augmentation by using 
numerical simulations is the most commonly 
used approach in seismology for active seismic 
data processing (47, 150), wave simulation (68), 
and velocity estimation (91). These models are 
usually fine tuned through some examples of 
field data. 

The hypothesis space can be constrained 
with domain knowledge through design of 
specialized neural network architectures based 
on the characteristics of the problem. Such 
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physics-inspired neural network architectures 
have been developed for seismic FWI (11, 145). 
Sun et al. (111) used a recurrent architecture 
to model time-dependent dynamics of seismic 
wave propagation. To enforce physical rules 
for wave propagation in the model, they set 
up LSTM-like cells—including some physical 
operators, which take the wavefield at a past 
instant as input—estimated the shot record at 
the current instant as output, and saved the 
modeled wavefield in the memory cell for cal- 
culating the next time step. In these models, 
the trainable parameters of the DNN can be 
viewed as a reparameterization of an Earth 
model. The forward propagation of informa- 
tion through this representation resembles 
the forward propagation of a seismic wave- 
field through a heterogeneous medium. Thus, 
training such a network and updating its 
weights by using DNN back-propagation and 
based on the misfit between the simulated 
and observed seismic data amounts to back- 
propagation of the residual wavefield and 
under certain assumptions is equivalent to the 
optimization process in conventional gradient- 
based FWI. This can result in a more interpret- 
able model in which the latent representation 
is physically meaningful. 

Physical constraints can be introduced into 
the training process either by incorporating 
them into the loss function (implicit approach), 
by explicit inclusion of a numerical model- 
ing into the training loop, or both. The first 
approach is used more recently in seismology 
and primarily for forward problems such as 
simulation of the pressure wavefield (68) or 
estimation of first arrival times (77). In these 
studies, a form of the wave equation (68) or 
Eikonal equation (77) is used to penalize the 
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Fig. 5. Explainable DNN. The domain knowledge, physics, can be incorporated into the training data through 
(A) data simulation, (B) the hypothesis space by designing specialized neural network architectures, 

and (C) the training procedure of a deep-learning model by enforcing consistency with governing equations. 
The architecture design based on domain knowledge provides some transparency and intepretability of 

the model. The interpretable representation of the input data along with some domain knowledge can be used 


to explain the output. 
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loss functions used for training the DNNs. 
Thus, during the training the model is simul- 
taneously optimized to fit the observed data 
while respecting physical constraints. The ex- 
amples of the second approach are mostly in 
seismic FWI in which a physics-based forward 
modeling of synthetic seismograms is per- 
formed as a part of the training loop to embed 
some physics-based constraints in the model 
(95, 107, 108, 110). In this approach, the mis- 
match between observed and numerically 
simulated seismic data is used to compute 
the gradient of the misfit function with respect 
to the DNN weights and update the initial 
Earth model represented by a DNN. 

Physics-guided (or physics-informed) deep 
learning is effective for solving ill-posed in- 
verse problems as well as for forward problems 
in seismology. They hold a great potential for 
wider applications in seismology beyond ex- 
amples mentioned above. Developing hybrid 
approaches by combining multiple physical 
constraints (for example, simultaneously in 
architecture design and loss function), using 
physics-based GANs to enforce realism for 
data generation (757), constraining a model 
statistically (157), quantifying uncertainties 
(152), and learning physical operators (153) 
are other directions of interest. 


Uncertainty quantification 


As development of deep-learning models for 
some seismological tasks progresses toward 
routine operations, quantification of corre- 
sponding aleatory and epistemic uncertainties 
becomes essential to assess the reliability of 
output predictions. Bayesian DNNs can play a 
key role in this area by providing probabilistic 
treatment of neural networks (154). In these 
models, a prior probability distribution is 
placed over DNN parameters, and a poste- 
rior of these parameters is learned during the 
training. This strategy can be seen as learn- 
ing by an infinite number of neural networks 
given the data in which the inference occurs 
by aggregating the results through ensembling. 
The high time complexity of Bayesian DNNs is 
the main challenge in their practical appli- 
cation; however, advances in scalable and 
easy-to-implement adaption of Bayesian DNNs 
are set to change that (155-158). 

Monte Carlo dropout sampling (158) is a 
popular technique for uncertainty estimation 
because of its flexibility and ease of implemen- 
tation (57, 94, 118, 159). Mousavi and Beroza 
(118), for example, developed a Bayesian CNN 
for earthquake location by using a customized 
loss function to estimate the aleatory uncer- 
tainties directly from data during the training 
process. The epistemic (model) uncertainties 
were estimated through dropout during infer- 
ence. In this approach, the loss function acts 
like an intelligent loss function and reduces 
the effect of uncertain data, whether because 
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of noise or unreliable labels; however, the 
effectiveness of this approach in estimating 
model uncertainties is limited. 

Recent computational advances (160, 161) 
for building deep probabilistic models provide 
alternative options for a more robust estima- 
tion of model uncertainties. Feng et al. (56) 
developed a Bayesian deep-learning frame- 
work for seismic facies analysis in which the 
Bayesian variational inference for approximat- 
ing the posterior distribution of DNN param- 
eters is formulated as an optimization process. 
In this approach, multiple random samples 
are drawn from the estimated posterior dis- 
tribution to quantify the model uncertainties 
by using the prediction entropy. 

Besides uncertainty quantification, the Bayes- 
ian DNN approach has some implicit regulari- 
zation effects that can suppress overfitting, 
especially when there is insufficient data. We 
expect that there will be an increased effort 
to develop probabilistic deep-learning models, 
especially for tasks such as seismic image 
interpretation, in which the cost of a wrong 
decision is high. Developing physics-informed 
Bayesian models would be one approach in 
this direction. 


Training data limitations 


The existence of large-scale realistic training 
datasets with diverse characteristics and pre- 
cise labels is crucial for the success of deep 
learning. Uncertainties and nonuniqueness 
of manual labels owing to limited resolution, 
presence of noise, different levels of expertise, 
cognitive biases, and inherent ambiguity of 
tasks is a limiting factor in the development of 
training data in seismology. Two analysts may 
disagree about the existence or location of a 
particular fault, or a deep horizon in a seismic 
image, or arrival times in an earthquake signal 
when labeling them, which biases training 
data. A challenging task in building training 
datasets is quality control of the labels. 

In addition to labeling uncertainties, some 
tasks such as seismic velocity inversion, image 
processing, and interpretation face the chal- 
lenge of a scarcity of public large-scale labeled 
data from field observations. One approach 
to address this limitation is to generate phys- 
ically realistic synthetic data that represent a 
variety of complex geological settings (162). 
For seismic velocity inversion, Wang et al. (163) 
used smoothed versions of natural images as 
seismic velocity models to produce synthetic 
seismic traces to use as training data. Feng et al. 
(164) developed a real-time style transform 
method to create a large set of physically real- 
istic subsurface velocity maps from natural 
images. They used these data in their multi- 
scale deep learning-based FWI. Araya-Polo et al. 
(165) used GANs to generate a large number of 
geologically feasible models to augment their 
original training data set. 
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Another approach uses inductive transfer 
learning. This was used to fine-tune pre- 
trained CNN models on handwritten images 
(8) or natural images (166) to classify volcano- 
seismic events according to their time-frequency 
representations. Developing semisupervised 
(20, 95, 167) and unsupervised (19, 35, 110) 
approaches for reducing a need for labels is a 
growing trend. Other options are adding con- 
straints to the training procedure (167, 168) 
and reducing the degrees of freedom (the 
number of trainable parameters) in super- 
vised approaches by incorporating physics- 
based constraints. 


Out-of-distribution generalization 


To evaluate the quality of the obtained model 
in most seismological applications, authors 
usually perform a blind test on a subset of the 
data not used in training and/or samples of 
data from other datasets (or regions). A blind- 
test dataset that is representative of the full 
diversity of the real world is needed to eval- 
uate the generalization of a model robustly. 
Statistical significance of such tests is usually 
not presented, and it is unclear to what extent 
a trained network can generalize. 

For applications such as earthquake signal 
detection and phase picking, in which the 
model generalizations have been studied ex- 
tensively (12, 22, 23, 25, 29, 169), neural net- 
work architecture, neural network type, and 
the size of a training set have been shown to 
be not the only defining factors for good out- 
of-distribution generalization (72) and that 
hyperparameter tuning (29), training proce- 
dure, and data augmentation (170) also play 
important roles. These effects have seen less 
attention in seismic image denoising, process- 
ing, and interpretation, in which model build- 
ing is often done by using the synthetic data 
and data augmentation is limited (in part 
because some common image augmentation 
techniques, such as rotation, result in physi- 
cally implausible models). 

Generalization of a deep-learning model 
always has a limit because each model may 
have blind spots, and out-of-distribution cases 
can always arise. This makes it difficult to have 
a distinct and robust measure of the general- 
izability. It remains unclear how much im- 
provement might be obtained by using physics- 
guided DNNs or Bayesian frameworks. More 
systematic studies are required to understand 
and characterize this important property of 
deep-learning models. 


Benchmarking pitfalls and concerning trends 


Despite the recent increase of applications 
of deep learning in seismology, not all the 
reported improvements lead to real-world 
performance increases. Often it is not clear 
what worked and why or which deep-learning 
approaches actually lead to better performance. 
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The choice of weak or often not properly fine- 
tuned baselines, different datasets, data pre- 
processing, evaluation protocols, performance 
metrics, the lack of result reproducibility, 
ablation studies, error analysis, and robust- 
ness checks are all factors that make it difficult 
to assess progress. To overcome this concern- 
ing trend and avoid the “phantom progress” 
phenomena observed in other fields (171), 
encouragement and ultimately enforcement 
of standard benchmarking need to be a 
necessary foundation of empirical machine 
learning research. 

The implementation of benchmarking is 
currently diverse and inconsistent across deep- 
learning seismology. Applications such as sub- 
surface characterization have well-established 
benchmarks, such as the Marmousi (/72) or 
BP2004 (173) models that are widely used; but 
even in this case, performance evaluation can 
be qualitative. Applications such as event 
discrimination and seismic denoising have 
seen a relatively better adaptation of report- 
ing metrics (for example, precision and recall, 
SNR, or mean-squared error) but suffer from 
the lack of canonical benchmarks. Because 
this is still a young field, there is no standard 
benchmark or evaluation setup for most of 
the deep-learning applications in seismology. 
As a result, researchers choose their own test 
data and evaluation criteria to showcase their 
method and (perhaps unknowingly) tend to 
find good performance for their models. This 
lack of community-driven sets of benchmarks 
and evaluation guidelines hinders progress in 
the field. 

Given the indispensable role of benchmark- 
ing for measuring progress, the community 
involved in deep-learning applications needs 
to set up more unified, standard, and rigorous 
evaluations that will ultimately lead to the 
development of insight and understanding 
because the community also are those who 
will endorse and enforce these practices. This 
requires first determining the guidelines for 
the reproducibility of the results, the choice 
of the baselines, building of benchmark 
datasets, and defining of task-specific metrics 
and standards for reporting the results. When 
creating a benchmark dataset for machine 
learning applications, documentation of its 
composition, collection process, labeling un- 
certainties, and applicable tasks, among other 
criteria, are all important. Only a few such 
benchmarks currently exist in seismology 
(174-177) specifically designed for machine 
learning. Evaluating a model on multiple 
benchmarks clearly would provide a more 
robust performance measurement. In this 
spirit, setting up guidelines on how to use 
benchmarks is also important. Clear report- 
ing of the training data and its size, model 
size, time and memory complexities, inference 
latency, suitable metrics indicating model 
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variance, its generalization to the out-of- 
distribution data, amount of hyperparameter 
tuning, and a discussion of statistical signif- 
icance of the results are all important factors 
that have received too little attention in the 
seismological community. Setting up clear 
guidelines for reviewers is the next step that 
could help promote and enforce greater rigor 
in the field and render more accurate and fair 
judgments when comparing models. 


Conclusions 


Deep learning has entered almost every sub- 
field of seismology and has the potential to 
dominate algorithm development. Yet as a 
general-purpose tool for scientific discovery, 
deep learning is in its infancy. Investing more 
effort on physics-based and explainable deep 
learning will help the field to mature. Research 
is rapidly moving from proof-of-concept efforts 
to real-world applications, and estimation of 
uncertainty will become essential. Given com- 
mon pitfalls and concerning trends, setting 
up standard benchmarking and evaluation 
protocols is critical. An understanding of the 
limitations and breaking points of these mod- 
els will be extremely helpful. Despite these 
challenges, deep-learning approaches will 
clearly contribute substantively to seismol- 
ogy by optimizing results, providing value, and 
sparking inspiration. Given the incredible 
ability of DNNs for modeling and processing 
large-volume, high-dimensional seismic data 
and their remarkable performance in varied 
seismological tasks, deep learning will soon 
become the norm in multiple seismic data 
analyses, notwithstanding issues of interpret- 
ability and brittleness. Deep learning will not 
replace expert seismological analysts, but those 
analysts who use it will outperform those who 
do not for many types of problems. 
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INTRODUCTION: Many organisms have evolved 
specialized immune pattern-recognition re- 
ceptors, including nucleotide-binding oligo- 
merization domain-like receptors (NLRs) of 
the STAND superfamily that are ubiquitous 
in plants, animals, and fungi. NLRs oligomerize 
upon recognition of pathogen-associated mo- 
lecular patterns, leading to the activation of 
an effector domain that mediates an inflam- 
matory or cell death response. Although the 
roles of NLRs in eukaryotic immunity are well 
established, it is unknown whether prokary- 
otes use similar defense mechanisms. 


RATIONALE: We previously identified a set of 
bacterial and archaeal STAND nucleoside tri- 
phosphatases (NTPases), dubbed Avs (antiviral 
STAND), that protect bacteria from tailed 
phages through an unknown mechanism. Like 
eukaryotic NLRs, Avs proteins have a charac- 
teristic tripartite domain architecture consist- 
ing of acentral NTPase, an extended C-terminal 
sensor, and an N-terminal effector. Here, we 
investigate the defense mechanism of these 
Avs proteins. 


RESULTS: Using genetic screens in Escherichia 
coli, we characterized four Avs families (Avs1 to 


Avs4) and found that they detect hallmark viral 
proteins that are expressed during infection. 
In particular, Avs1 to Avs3 recognize the large 
terminase subunit, and Avs4 recognizes the 
portal. These two proteins together make up the 
conserved DNA packaging machinery of tailed 
phages. Coexpression of an Avs protein with its 
cognate target in E. coli resulted in cell death. 

We assessed the specificity of Avs target 
recognition with a panel of terminases and 
portals from 24 tailed phages, spanning nine 
major families. Notably, a single Avs protein 
was capable of recognizing a large variety of 
targets (terminase or portal), with less than 5% 
sequence identity in some cases. 

We next reconstituted Avs activity in vitro, 
focusing on representatives from Salmonella 
enterica (SeAvs3) and E. coli (EcAvs4), both of 
which contain N-terminal PD-DExK nuclease 
effectors. In the presence of their cognate tar- 
get, SeAvs3 and EcAvs4 mediated degradation 
of double-stranded DNA. Nuclease activity re- 
quired the presence of Mg”* and adenosine tri- 
phosphate (ATP); however, the hydrolysis of 
ATP was not strictly required. Single-stranded 
DNA and RNA substrates were not cleaved. 

We determined the cryo-electron microscopy 
structures of the SeAvs3-terminase and EcAvs4- 
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portal complexes, revealing that both form tet- 
ramers in which the C-terminal sensor domain 
of each Avs subunit binds to a single target 
protein. Binding is mediated by shape comple- 
mentarity across an extended interface, consis- 
tent with fold recognition. Additionally, SeAvs3 
directly recognizes terminase active-site residues 
and its ATP ligand. Tetramerization of both 
SeAvs3 and EcAvs4 is mediated by their STAND 
ATPase domains and allows the N-terminal nu- 
cleases to adopt active dimeric configurations. 

Bioinformatic analysis of Avs proteins across 
prokaryotic lineages revealed at least 18 distinct 
types of N-terminal effectors that are modularly 
swapped between Avs homologs, as well as 
widespread distribution of avs genes across 
phyla with extensive horizontal gene transfer. 
Finally, we also discovered phage-encoded Avs 
inhibitors, highlighting an extensive arms race 
between prokaryotes and viruses. 


CONCLUSION: NLR-like defense proteins in 
bacteria and archaea recognize the conserved 
folds of hallmark viral proteins and assemble 
into tetramers that activate diverse N-terminal 
effectors. The mechanism of these proteins 
highlights the similarity between the defense 
strategies of prokaryotes and eukaryotes and 
extends the paradigm of pattern recogni- 
tion of pathogen-specific proteins across all 
three domains of life. 
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Mechanism and structures of NLR-like defense proteins in prokaryotes. (Left) Comparison of the domain architectures of 11 representative NLR-like pattern- 
recognition receptors across four kingdoms of life. Selected structures of activated complexes are shown as examples. T3SS, type 3 secretion system. (Right) Defense 
mechanism of Avs proteins in bacteria and archaea (this study). Target binding triggers the formation of Avs tetramers, which activates an N-terminal effector 


that disrupts the viral life cycle. 
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Many organisms have evolved specialized immune pattern-recognition receptors, including nucleotide- 
binding oligomerization domain-like receptors (NLRs) of the STAND superfamily that are ubiquitous 

in plants, animals, and fungi. Although the roles of NLRs in eukaryotic immunity are well established, 
it is unknown whether prokaryotes use similar defense mechanisms. Here, we show that antiviral STAND 
(Avs) homologs in bacteria and archaea detect hallmark viral proteins, triggering Avs tetramerization 
and the activation of diverse N-terminal effector domains, including DNA endonucleases, to abrogate 
infection. Cryo-electron microscopy reveals that Avs sensor domains recognize conserved folds, active- 
site residues, and enzyme ligands, allowing a single Avs receptor to detect a wide variety of viruses. 
These findings extend the paradigm of pattern recognition of pathogen-specific proteins across all 


three domains of life. 


acteria and archaea have evolved nu- 

merous defense mechanisms against 

viral infections that involve a wide range 

of strategies and enzymatic activities 

(1-5). Defense systems are activated by 
viral nucleic acids, in the case of restriction- 
modification and CRISPR-Cas systems, or by 
different types of infection-induced cellular 
stress, including DNA double-strand breaks (6), 
inhibition of host transcription (7), cytosolic 
nucleotide depletion (8), and the disruption 
of translation elongation factor EF-Tu (9) or 
RecBCD repair nuclease (10). Alternatively, 
some systems constitutively synthesize small 
molecules that interfere with phage replica- 
tion (11, 12). However, for numerous defense 
systems, the mechanisms of activation remain 
uncharacterized, and it appears likely that 
distinct modes of activation exist within the 
diverse repertoire of recently discovered sys- 
tems (/, 3, 4, 13). 

STAND nucleoside triphosphatases (NTPases), 
which include nucleotide-binding oligomer- 
ization domain-like receptors (NLRs) such as 
animal inflammasomes and plant resistosomes, 
are among the key players in immunity, cell 
signaling, and particularly programmed cell 
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death in eukaryotes (14-21). STAND NTPases 
have a conserved tripartite domain architecture, 
consisting of a central NTPase domain, a 
C-terminal sensor that contains superstructure- 
forming repeats, and, in many cases, an N- 
terminal effector that mediates inflammation 
or cell death. In animal and plant innate im- 
munity, STAND proteins function by recog- 
nizing diverse pathogen-associated molecular 
patterns (PAMPs), including peptidoglycan 
fragments from the bacterial cell wall (18), 
double-stranded RNA (27), bacterial flagel- 
lin and type 3 secretion systems (16, 17), and 
endogenous host proteins that have been 
modified by pathogens (19, 20). In all of 
these cases, recognition of the PAMP leads 
to oligomerization of the STAND NTPase 
and activation or recruitment of effector 
proteins. 

Bacteria and archaea, especially those with 
complex signaling systems, also encode a di- 
verse repertoire of STAND NTPases that are 
predicted to be involved in signal transduction 
and possibly in programmed cell death (14, 15). 
However, the functions of these proteins are 
largely unknown, with the exception of several 
that have been characterized as transcriptional 
regulators (22-24). We recently identified a 
group of prokaryotic STAND NTPases, dubbed 
Avs (antiviral STAND) (4), that are often en- 
coded next to restriction-modification and 
other defense systems (fig. S1) and protect 
bacteria from tailed phages (fig. S2). Here, 
we investigate the mechanism of Avs proteins. 


Avs systems are activated by two conserved 
phage proteins 

Although the domain architectures of Avs 
proteins resemble those of eukaryotic NLRs 
(Fig. 1, A and B), it is unclear whether they 
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function through similar molecular mecha- 
nisms. We identified four distinct families of 
Avs proteins (Avs1 to Avs4) (fig. S3), each of 
which contains highly divergent tetratrico- 
peptide repeat (TPR)-like sensor domains, and 
selected two representatives for further char- 
acterization: SeAvs3 from Salmonella enterica 
NCTC13175 and EcAvs4 from Escherichia coli 
NCTC11132, both of which provide robust pro- 
tection against the T7-like coliphage PhiV-1 
(fig. $2). We first asked how phage infection 
leads to Avs activation and whether a specific 
phage-encoded trigger exists for these defense 
systems. We cloned fragments covering the 
whole PhiV-1 phage genome into expression 
plasmids and transformed the resulting frag- 
ment library into E. coli containing either Avs 
proteins or empty-vector controls (Fig. 1C and 
data S1). We hypothesized that coexpression of 
Avs proteins with their putative triggers might 
lead to cell death and depletion of the re- 
spective phage genes from the pool, and we 
performed deep sequencing to detect enrich- 
ment or depletion of phage genes. Four phage 
genes were generally toxic to all cells; how- 
ever, two genes were depleted only in the 
presence of Avs proteins, namely the large 
terminase subunit (gp19) when coexpressed 
with SeAvs3 and the portal protein (gp8) 
when expressed with EcAvs4 (Fig. 1D and 
fig. S4). By Southern blot, we observed that 
Avs3- and Avs4-mediated depletion of phage 
DNA during infection was abolished in gp19 
and gp8 knockout phage strains, respectively 
(Fig. 1E and fig. S5), indicating that gp19 and 
gp8 are both necessary and sufficient for Avs 
activation. 

To validate these findings, we transformed 
plasmids expressing gpl19 or gp8 into E. coli 
harboring SeAvs3 or EcAvs4 and measured 
cell viability. Consistent with our previous 
results, we observed cell death after coexpres- 
sion of SeAvs3 and gp19, as well as coexpression 
of EcAvs4 and gp8, but not with the reciprocal 
pairs (Fig. 1E). This toxicity depended on the 
predicted nuclease activity of both SeAvs3 
and EcAvs4 and, importantly, was not due to 
any intrinsic features of the natural phage 
gene sequence, because recoded gene sequences 
also led to cell death (Fig. 1F). Furthermore, the 
enzymatic activity of the phage terminase, 
which contains adenosine triphosphatase 
(ATPase) and nuclease domains unrelated to 
those of Avs proteins, was not required for 
SeAvs3-mediated toxicity (Fig. 1F). 


Avs proteins recognize a diverse range of 
terminase and portal proteins 


To investigate the specificity of Avs activation, 
we cloned the portal and large terminase sub- 
unit genes from 24 tailed phages, spanning 
nine major phage families, and coexpressed 
these genes in FE. coli with 15 Avs systems 
spanning all four Avs families (data S2 and S3). 
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Fig. 1. Prokaryotic STAND NTPases recognize phage terminase and portal 
proteins. (A) Maximum likelihood tree of the ATPase domain of selected 
NLR-like STAND NTPases in four model organisms across kingdoms of life. 

(B) Domain architectures of representative NLR-like genes in (A). Ankyrin, ankyrin 
repeat; BIR, baculoviral inhibitor of apoptosis repeat; CARD, caspase activation 
and recruitment domain; FIIND, function to find domain; LRR, leucine-rich repeat; 
PLP, patatin-like phospholipase; PYD, pyrin domain; RX-CC, potato virus X 


We quantified cellular toxicity and depletion of 
specific Avs-phage protein pairs for all 720 
combinations by deep sequencing (Fig. 2A and 
data S2). These experiments revealed precise 
target specificity: Avs1 to Avs3 recognized only 
large terminase subunits, whereas Avs4 recog- 
nized only the portal protein (Fig. 2B and 
fig. S6). To assess the robustness of the assay, 
we repeated these experiments by varying the 
Avs promoter or the amount of terminase 
and portal induction and obtained similar 
results (fig. $7). Surprisingly, Avs] and Avs2 
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also recognized terminases despite the lack of 
substantial sequence similarity among the 
C-terminal TPRs of Avs1 to Avs3, although 
we detected a structurally similar B sheet- 
rich domain at the end of the TPR arrays in 
all three proteins (fig. S8). These findings 
demonstrate conservation of target recogni- 
tion across Avs families and suggest that the 
portal and large terminase subunit are key 
PAMPs that are recognized by prokaryotes. 
Moreover, Avs systems recognize PAMPs from 
diverse phages; for example, SeAvs3 and EcAvs2 
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resistance protein coiled-coil domain; TPR, tetratricopeptide repeat; WD40, WD40 
repeat. (C) Schematic of the genetic screening approach used to identify phage- 
encoded activators of Avs proteins that induce cell death. Amp®, ampicillin 
resistance gene. (D) Genetic screen results for phage-encoded activators. 

(E) Quantification of the phage DNA band intensity in a Southern blot of DNA 
isolated from phage-infected E. coli. WT, wild type. (F) Photographs of E. coli 
cotransformation assays with avs genes and phage activators identified in (D). 


were strongly activated by 20 of 24 and 19 of 
24 tested terminases, respectively, and EcAvs4 
was strongly activated by 15 of 24 tested portals 
(>100-fold depletion) (Fig. 1F). Because the 
portals and terminases from different phage 
families have limited sequence similarity, with 
less than 5% pairwise sequence identity in some 
cases (Fig. 2C), but share the same core fold 
(figs. S9 to S11), this broad range of activity 
implies that Avs proteins are triggered by 
conserved structural features rather than by 
specific peptide sequences. Consistent with 
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Fig. 2. Avs proteins are pattern-recognition receptors for the terminase 
and portal of diverse tailed phages. (A) Schematic of the plasmid depletion 
assay. (B) Heatmaps of plasmid depletion for the terminase and portal proteins 
of representative phages spanning nine major tailed phage families. The native 
avs promoter was retained for all homologs except for those outside of the 
Enterobacteriaceae family (EpAvsl and CcAvs4). Terminases and portals were 
induced with 0.002% arabinose. Horizontal black lines at the top of the panel 


Gao et al., Science 377, eabm4096 (2022) 12 August 2022 


indicate groups of terminase proteins with at least 20% pairwise sequence 
identity. Asterisks indicate prophages. aa, amino acids; CTD, C-terminal domain; 
D. archaeon, Desulfurococcales archaeon ex4484_217_2; E. coli-1, E. coli 
NCTC9020; E. coli-2, E. coli M885; S. flava, Sphingopyxis flava R11H. (C) Pairwise 
amino-acid sequence identity between the core folds of the terminases and 
portals in (B), excluding nonconserved regions. (D) Activity of four Avs proteins 
against the human herpesvirus 8 (HHV-8) terminase and portal. 
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this possibility, EcAvs2 and EcAvs4 displayed 
weak but clear recognition of the terminase 
and portal, respectively, of human herpesvirus 8 
(Fig. 2D), which is a highly diverged evolution- 
ary derivative of tailed phages (25) and does not 
infect prokaryotes. 


SeAvs3 and EcAvs4 are phage-activated 
DNA endonucleases 


SeAvs3 and EcAvs4 contain predicted N- 
terminal PD-DExK-family nuclease domains 
(Fig. 3A), which we hypothesized degrade 
phage and cellular DNA upon target recog- 
nition. The nuclease domain of SeAvs3 is most 
similar to Cap4 effector nucleases of cyclic 
oligonucleotide-based defense systems (26, 27), 


whereas EcAvs4 has an Mrr-like restriction 
endonuclease (28). Both Avs proteins contain 
conserved D-QxK or E-QxK catalytic motifs 
(Fig. 3B), and, in addition to the STAND NTPase, 
the SeAvs3 system contains a small open read- 
ing frame (ORF), the deletion of which reduced 
antiphage activity in E. coli (fig. S12A). 

To biochemically reconstitute Avs activity 
in vitro, we purified recombinant SeAvs3, the 
protein encoded by the small ORF, EcAvs4, 
and the PhiV-1 terminase (gp19) and portal 
(gp8) proteins (fig. S13A). We incubated 
SeAvs3 and the small ORF product with linear 
double-stranded DNA (dsDNA) and observed 
progressive degradation of the substrate in 
the presence of gp19 but not gp8 (Fig. 3, C and 


D). This nuclease activity was dependent on 
the catalytic residues of SeAvs3 but did not 
require the small ORF product (Fig. 3C). We 
further investigated the substrate specificity 
of Avs systems and found that the nuclease 
activity was specific for dsDNA, whereas single- 
stranded DNA and RNA were not cleaved (fig. 
S13B). Moreover, SeAvs3 cleaved both linear 
and circular dsDNA, including EF. coli genomic 
DNA (fig. S13C), indicative of endonuclease 
activity with no specificity for phage DNA, 
which is consistent with an abortive infec- 
tion defense mechanism. 

We next investigated cofactor requirements 
of SeAvs3 and found that in vitro activity de- 
pends on both Mg” and adenosine triphosphate 
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Fig. 3. SeAvs3 and EcAvs4 are phage-activated DNA endonucleases. Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. (C to E) Agarose gel 


(A) Domain architecture of SeAvs3 and EcAvs4. (B) Alignment of Avs D-QxK 
nuclease motifs with characterized Cap4 and Mrr representatives. Single-letter 
abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; 
E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; 
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analysis of SeAvs3 nuclease activity in vitro with a linear dsDNA substrate 
{(C) and (D)] and cofactor requirements (E). (F to H) Agarose gel analysis of 
EcAvs4 nuclease activity in vitro with a linear dsDNA substrate [(F) and (G)] 
and cofactor requirements (H). 
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(ATP); however, ATP hydrolysis is not strictly 
required because nuclease activity was ob- 
served at a reduced level in the presence of 
the nonhydrolyzable ATP analog adenylyl- 
imidodiphosphate (AMP-PNP) (Fig. 3E). We also 
found that the nuclease activity of EcAvs4 is 
activated by gp8, but not by gp19, and was abol- 
ished in an EcAvs4 Q63A/K65A (GIn®?—Ala/ 
Lys®—Ala) nuclease mutant (Fig. 3F). Similar 
to SeAvs3, nuclease activity of EcAvs4 required 
the presence, but not the hydrolysis, of ATP 
(Fig. 3H), consistent with phage plaque assays 
of SeAvs3 and EcAvs4 ATPase active-site mu- 
tants (fig. S12B). Together, these experiments 
indicate that SeAvs3 and EcAvs4 are promis- 
cuous DNA endonucleases that are activated 
by distinct, highly conserved phage proteins 
in an ATP-dependent manner. 


Structural basis of Avs binding and 
target recognition 


To investigate how Avs systems recognize and 
bind their cognate phage proteins, we solved 
cryo-electron microscopy (cryo-EM) structures 
of the SeAvs3-terminase and EcAvs4-portal 
complexes in the presence of ATP and Mg** 
(figs. S14 to S17 and table S1). A reconstruc- 
tion at 3.4-A resolution revealed that SeAvs3 
forms a tetramer, with each C-terminal TPR 
domain gripping the ATPase and nuclease do- 
mains of the gp19 terminase (Fig. 4, A and B). 
These TPR lobes are flexible and required 
symmetry expansion to improve their local 
resolution to 3.4 A (Materials and methods 
and figs. S15 to $17). For EcAvs4 bound to the 
PhiV-1 gp8 portal (Fig. 4, C and D), image pro- 
cessing revealed equal abundances of a tetra- 
meric complex and an octameric complex 
corresponding to tetramer head-to-head di- 
merization (fig. $14). At lower protein con- 
centrations, however, we observed only the 
tetramer, indicating that it is most likely the 
functional complex (fig. $14). Negative-stain 
and cryo-EM imaging of SeAvs3 and EcAvs4 
in the absence of phage proteins revealed 
only smaller monomeric particles (fig. S14), 
indicating that phage protein binding is re- 
quired for the assembly of SeAvs3 and EcAvs4 
into tetramers. 

Tetramerization of both SeAvs3 and EcAvs4 
is mediated through their STAND ATPase 
domains, which interact in a manner dis- 
tinct from each other and from other char- 
acterized STAND ATPase oligomers like the 
RoqI resistosome tetramer (29) or the Apafl 
apoptosome heptamer (30) (fig. S18). The 
SeAvs3 ATPase domain forms a C4-symmetric 
tetramer by interactions between the nucleotide- 
binding (NBD) and winged-helix (WHD) sub- 
domains and the NBD subdomain of the 
adjacent protomer, whereas the EcAvs4 ATPase 
domain forms a C2-symmetric dimer of dimers 
with a tighter interface (1232 A? of buried sur- 
face area, compared to 436 A? for SeAvs3), 
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with adjacent WHDs and NBDs both interact- 
ing (fig. S18). The smaller interface in SeAvs3 
is compensated for by additional contacts be- 
tween its C-terminal TPR domains (Fig. 4, A 
and B). SeAvs3 and EcAvs4 both maintain ATP 
in their active sites with an adjacent magnesium 
ion coordinated by the canonical Walker A and 
B motifs (Fig. 4, E and F). Notably, in both cases, 
tetramerization of the STAND ATPase domains 
brings adjacent N-terminal nuclease domains 
close together, forming two nuclease dimers 
with overall C2 symmetry (Fig. 4, G and L). 
SeAvs3 and EcAvs4 contain nuclease effec- 
tors of the PD-DExK superfamily. Conven- 
tional PD-DExK nucleases (e.g., restriction 
endonucleases) use a pair of acidic residues 
to coordinate at least one metal ion anda 
conserved lysine residue to bind the scissile 
phosphate and stabilize the transition state 
for nucleolytic cleavage (Fig. 4J) (31, 32). In the 
SeAvs3 Cap4 tetramer, this arrangement of 
residues is found in all four protomers; how- 
ever, in the two “outward-facing” protomers, 
an extended B strand makes a steric block for 
DNA binding and/or metal coordination (Fig. 
41). In the two “inward-facing” protomers, a 
homotypic interface between the N-terminal 
a helices prevents the formation of this B strand, 
enabling the catalytic site to adopt the active 
configuration (Fig. 4H). Based on the crystal 
structure of the Hind III restriction endo- 
nuclease (33), the inward-facing protomers 
can be predicted to form a cavity for DNA 
binding, with each protomer likely cleaving 
opposite strands of the DNA (Fig. 4K). The 
EcAvs4 Mrr tetramer shows a similar princi- 
ple, whereby the two inward-facing protomers 
contain active sites that resemble canonical 
PD-DEXxK nucleases, but in the outward-facing 
protomers, Glu*’, which is part of the conserved 
trio of active-site residues, is displaced (Fig. 4, 
Mand N). Glu” is found in the loop that spans 
residues 33 to 52, and interactions between 
this loop on an “inward” protomer and an ad- 
jacent “outward” protomer likely stabilize its 
position in the inward protomer. Like the 
SeAvs3 Cap4 tetramer, these two inward 
protomers form a cavity that accommodates 
DNA in a manner similar to Hind III (Fig. 4K). 
SeAvs3 and EcAvs4 both contain extensive 
TPR domains for binding their cognate phage 
proteins, which we confirmed using a bacterial 
two-hybrid system and protein copurification 
(figs. S19 and $20). The SeAvs3 TPR domain 
forms a left-hand-like structure capped by a 
6 sheet-rich C-terminal domain (Fig. 5A). This 
domain has two cavities in which the termi- 
nase ATPase and nuclease domains are nestled. 
Consistent with the ability of SeAvs3 to bind 
terminases with little sequence similarity 
(Fig. 2C), there are few specific residue- 
residue pair contacts between SeAvs3 and the 
PhiV-1 terminase. Instead, binding is deter- 
mined by shape and charge complementarity 
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between the two proteins, burying more than 
3700 A? of solvent-accessible surface area. This 
complementarity is maintained across a diverse 
range of experimental structures and AlphaFold 
models of phage terminases (Fig. 5B). Addi- 
tionally, SeAvs3 directly recognizes residues 
within the two terminase active sites. In par- 
ticular, Asp’”° in SeAvs3 forms a salt bridge 
with the highly conserved Arg® within the 
Walker A motif of the terminase ATPase (Fig. 5, 
Cand E). An arginine in this position is found 
in most terminase ATPases that activate SeAvs3 
but not in nonactivating terminases (Fig. 5E 
and fig. S21). These observations suggest that 
Are”! within the Walker A motif is a determi- 
nant of recognition specificity, and indeed, 
mutation of the cognate arginine in the T4 
terminase ATPase domain substantially re- 
duced SeAvs3 activation by the ATPase domain 
(Fig. 5F). Notably, an arginine is not typically 
found in this position in endogenous cellular 
ATPases (34), suggesting a possible mechanism 
for avoiding off-target activation. Furthermore, 
Arg" 96 and Lys"'°* in SeAvs3 form salt bridges 
to the four conserved aspartates that make 
up the active site of the terminase nuclease 
(Fig. 5, D and E), and mutation of Asp” in the 
PhiV-1 terminase nuclease notably reduced 
SeAvs3 activation by the nuclease domain (Fig. 
5F). Thus, SeAvs3 directly reads the active-site 
residues of both domains of the terminase. 
Furthermore, the ATP ligand bound by the 
terminase is detected by interactions between 
the gamma phosphate and His’ and Tyr!” 
of SeAvs3 (Fig. 5C). Targeting this ligand 
presumably helps avoid phage escape muta- 
tions, because ATP binding is required for the 
function of the terminase. 

Because SeAvs3 detects both domains of the 
terminase, we hypothesized that there might 
be some functional redundancy in these inter- 
actions. Indeed, SeAvs3 was activated by the 
nuclease domain alone from some phages, 
including T7, but was also activated by the 
ATPase domain alone from T4 and ZL19, a 
T1 family phage (fig. S22). Likewise, SeAvs1 
was activated by the nuclease domain from 
T7, but in the case of ST32, both the nuclease 
and ATPase domains were required. These 
results suggest that Avsl and Avs3 recognize 
both the nuclease and ATPase domains but 
differ in the extent of activation by either do- 
main, depending on the terminase. By con- 
trast, deletion of the nuclease domain had no 
impact on Avs2 activity for any of the five tested 
terminases, suggesting that Avs2 recognizes 
the ATPase domain only (fig. S22). This pat- 
tern of recognition is consistent with the larger 
size of Avs1 and Avs3 compared with Avs2. 

The TPR domain of EcAvs4 also binds the 
PhiV-1 portal with a large interface, burying 
5800 A? of solvent-accessible surface area, that 
includes notably few residue-residue contacts 
(Fig. 5G). The portal protein is recognized 
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Fig. 4. Cryo-EM structures A & & Cc 
of SeAvs3 and EcAvs4 SeAvs3 ATPase TPR >» 
in complex with their > Sh sv 

we os 


cognate triggers. (A and 
B) Structure of the SeAvs3- 
terminase complex. 

(C and D) Structure of 

the EcAvs4-portal complex. 
(E and F) ATP molecule 

in the STAND ATPase active 
site of EcAvs4 and SeAvs3. 
The cryo-EM density is 
shown as a transparent 
surface. (G) SeAvs3 
Cap4-like nuclease effector 
domain. (H and 1) Active 
sites for the inward- and 
outward-facing protomers 
of the SeAvs3 Cap4-like 
nuclease. (J) Equivalent view 
of the active site of Hind Ill B 
bound to target DNA with 

two divalent metal ions 

[Protein Data Bank (PDB) 

ID 3A4K]. (K) Electrostatic 
surface potential for the 

SeAvs3 Cap4-like nuclease 

and the EcAvs4 Mrr-like 

nuclease. Active sites are 
indicated by purple circles. 

Ideal B-form DNA is 

modeled on both surfaces 

based on the crystal 

structure of Hind III bound 

to its target (PDB ID G 
3A4K). (L) EcAvs4 Mrr-like 
nuclease effector domain. 
(M and N) Active sites 

for the inward- and outward- 
facing protomers of the 
EcAvs4 Mrr-like nuclease. 


Cap4 nuclease 
Inward-facing active site 


through its stem, clip, and part of its wing 
domain. In an assembled dodecameric portal 
complex, these regions are found toward the 
interior and are therefore more constrained in 
their fold requirements (Fig. 51). Consistent 
with this observation, we performed random 
mutagenesis by polymerase chain reaction 
(PCR) to screen for portal mutations that 
abrogate Avs4 activation and found that all 
29 identified mutations were nonconservative 
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and located in the core wing or stem regions 
(fig. S23), possibly disrupting the core portal 
fold. The clip domain, which contains a con- 
served antiparallel 8 sheet with an intervening 
a helix, is recognized by B-sheet augmenta- 
tion with a hairpin of EcAvs4 (Fig. 5H), a mode 
of fold recognition that does not depend on 
the amino acid sequence of the target. Be- 
cause portal proteins are not enzymes, there 
are no active-site residues to target as in the 
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SeAvs3-terminase complex. Finally, portal oligo- 
merization is not compatible with the Avs4- 
bound state (Fig. 5H), suggesting that Avs4 
recognizes portal monomers before they assem- 
ble into the procapsid. 


Avs proteins are widespread and possess 
diverse, modular effector domains 


To assess the diversity of avs genes in pro- 
karyotes, we collected all intact homologs 
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Fig. 5. Structural basis for viral-fold recognition by SeAvs3 and EcAvs4. 
(A) The interface between SeAvs3 and the PhiV-1 terminase. An SeAvs3 surface 
view is shown in transparency. SeAvs3 is colored from the N to C terminus 
according to the key. (B) AlphaFold or crystal structures of different terminases 
modeled into SeAvs3. The ATPase and nuclease domains were individually 
aligned to the PhiV-1 terminase domains. (C and D) Recognition of the 

PhiV-1 terminase ATPase and nuclease active sites by the SeAvs3 TPR domain. 
(E) Sequence logos for terminase ATPase Walker A motifs and terminase 
nuclease active sites. A total of 11,000 terminase sequences were clustered at 
30% sequence identity, and motifs were extracted from clusters containing 
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terminases targeted or not targeted by SeAvs3 according to Fig. 2B (see also 
fig. S21). (F) Plasmid depletion assay for SeAvs3 coexpressed in E. coli with 

a terminase ATPase or nuclease domain harboring active-site mutations. (G) The 
interface between EcAvs4 and the PhiV-1 portal. An EcAvs4 surface view is 
shown in transparency. EcAvs4 is colored from the N to C terminus according to 
the key. (H) B-sheet augmentation between EcAvs4 and the portal clip domain. 
(1) Comparison of the EcAvs4-bound state of the PhiV-1 portal, the cryo-EM 
structure of the highly homologous T7 portal in its native virion, and AlphaFold 
models of diverse portals. A top view of the assembled dodecamer of the T7 
portal is also shown. 


from each of the four families present in the 
National Center for Biotechnology Informa- 
tion (NCBI) nonredundant protein sequence 
database (data S4 to S8). The avs genes were 
identified in ~4 to 5% of sequenced prokaryotic 
genomes and are broadly distributed (Fig. 6A), 
with at least one avs gene detected in 27 of 


29 and 3 of 10 well-represented bacterial and 
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archaeal phyla, respectively (Fig. 6B and fig. 
$24A). Each Avs family has a characteristic pro- 
tein size (Fig. 6C). We next constructed phylo- 
genetic trees of each of the four families (Fig. 6, 
D and §£; and fig. $24, B and C) and found that 
these trees did not follow bacterial and archaeal 
phylogenies, suggesting extensive horizontal 
gene transfer, particularly for avs2 and avs4, in 
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agreement with previous analyses of STAND 
NTPases (/4, 15). Furthermore, we detected at 
least 18 distinct types of N-terminal effector 
domains present in Avs proteins, including 
non-nuclease domains such as proteases, nucle- 
osidases, sirtuins (SIR2), Toll/interleukin-1 
receptor homology (TIR) domains, cytidine mo- 
nophosphate (CMP) hydrolases, transmembrane 
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Fig. 6. Taxonomic distribution and A 
domain architectures of Avs families. 
(A) Distribution of avs genes across 
phyla. The values above the bars 
indicate the number and percentage 

of genomes containing each gene. 

PVC, Planctomycetota, Verrucomicrobiota, 
and Chlamydiota. (B) Number of 
bacterial and archaeal phyla (minimum 
100 sequenced isolates) with at least 
one detected instance of an avs 

gene. (C) Kernel density plots of the 
ength distribution of Avs proteins, 
excluding the N-terminal domain. The 
ed lines indicate medians. ****p < 
0.0001 (Mann-Whitney). (D and E) Max- 
imum likelihood tree of representatives 
of the ATPase + C-terminal domain 

of Avs2 (n = 1255) (D) and Avs4 

(n = 1089) (E) clustered at 95% 
sequence identity. See fig. S24 for the 
trees for Avsl and Avs3. Stars on the 
outer ring indicate homologs investigated 
experimentally in this study. HTH, 
helix-turn-helix; MBL, metallo-B- 
actamase; REase, restriction endo- 
nuclease. (F) Phage plaque assays 
showing antiphage defense activity of a 
chimeric Avs4 with transmembrane 
N-terminal helices from Sulfurospirillum 
sp. replacing the Mrr-like nuclease 
domain of EcAvs4. The X indicates a 
nuclease domain mutation. 


PhiV-1 


ZL19 * be oe! 


helices, and domains with unknown functions 
(data S4 to S8). Some less common variants 
are predicted to participate in intracellular sig- 
naling networks through effector-associated 
domains (EADs) that may recruit a caspase-like 
protease by protein-protein interaction (35, 36) 
(fig. S25), reminiscent of animal NLRs. 

The apparent frequent exchange of N- 
terminal domains in the evolution of the Avs 


Gao et al., Science 377, eabm4.096 (2022) 


E Avsé (portal) 


2,368 5,182 1,125 2,647 10,915 
9 704.3900.9% 2.0% 0.4% 1.0% 4.1% 
ws Proteobacteria 
- al = a Se = m@ Firmicutes 
| ~ Actinobacteria 
lm Bacteroidetes 
lm PVC group 
Cyanobacteria 
_ Other 
lm Archaea 
Unclassified 


Number of phyla 
8 


—— 
ei 


2 *EcAvs2 
Ec3Avs4 + 


CcAvs4 +, 


ws Bacteria 
ws Archaea 


2000 
1800 
1600 
1400 
1200 
1000 


N-terminal effector 


Amino acids excluding oO 


Taxonomy (inner ring) 

{Mf Proteobacteria { Firmicutes 

- Actinobacteria gj Cyanobacteria 
{™@ Bacteroidetes gj Archaea 


li PVC group 
- Other 
Unclassified 


N-terminal domains (outer ring) 

NTD ATPase sensor f 2 ot 

' | Cap4 

| Mrr 

HNH 

| REase 

' Putative REase 

| MBL + serine protease 
' MBL + serine protease 
: Trypco2 + serine protease 
' Caspase + EAD10 

| Nucleosidase 

' CMP hydrolase 

: TIR 

| SIR2 

| Transmembrane 

' Transmembrane 

| Unknown-1 

+ Unknown-2 

| Unclassified 

| Short 

ok 2 
0123 
log, count 


Proteases Nucleases 


Other 


N-terminal domains (outer ring) 
NTD ATPase sensor ‘ fa 
| | Cap4 
! HTH-Cap4 
' Cap4 
| Mer 
: HNH 
' Putative REase 
| MBL + serine protease 
: Trypco2 + serine protease 
Caspase 
| Caspase + EAD8& 
! Nucleosidase 
' Nucleosidase 
i TIR 
: Metallophosphoesterase 
| Transmembrane 
| Unknown-3 
Unknown-4 
! ! Short 
5 aes age 
0123 
log,, count 


Nucleases 


Proteases 


EcAvs4 
Lt 


Empty vector 


EcAvs4 Q63A/K65A_ EcAvs4 chimera 
o 


-—— 


a 


families emphasizes the modular organization 
characteristic of STAND NTPases (15) and 
implies that closely related ATPase and TPR 
domains can activate a wide range of effector 
functions beyond DNA cleavage. To test this 
hypothesis, we chose an Avs4 homolog from 
Sulfurospirillum sp. that contains ATPase and 
TPR domains that are highly similar to those 
of EcAvs4, with 44% overall amino acid iden- 
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tity, but encompasses an N-terminal region 
with predicted transmembrane helices instead 
of a nuclease (fig. S26A). We generated a chi- 
meric Avs4 protein by transplanting the trans- 
membrane domain to EcAvs4 and found that 
the chimera conferred protection against T7, 
PhiV-1, and ZL19 (Fig. 6F and fig. S26B) while 
retaining the ability to recognize the portal 
proteins from diverse phages (fig. S26C). 
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Phages inhibit Avs defense through diverse 
antidefense proteins 

Bacterial and archaeal viruses have evolved 
diverse mechanisms to counteract defense sys- 
tems (37), including numerous anti-restriction 
and anti-CRISPR proteins (38, 39). We hypoth- 
esized that Avs inhibitors might exist among 
phage early genes, which are expressed be- 
fore the portal and terminase genes during 


the phage life cycle. Focusing on the Auto- 
graphiviridae family of T7-like coliphages, 
which have readily identifiable early genes, as 
well as portals and terminases that strongly 
activate Avs proteins, we identified a set of 
122 representative early genes that typically 
encode small proteins (median length 77 amino 
acids), typical of antidefense genes (data S9). 
We performed a genetic screen for suppressors 


of Avs toxicity by coexpressing these genes with 
SeAvs3, EcAvs4, or KpAvs4 and their cognate 
trigger (Fig. 7A). We identified several early 
genes that rescued cell growth (Fig. 7B), most 
of which originate from a hypervariable re- 
gion within a group of closely related phages 
isolated from wastewater (Fig. 7C) (40, 41). 
To validate these observations, we pro- 
duced three of the Avs inhibitors by cell-free 


Fig. 7. Phage-encoded genes A ie aie B . eis 
inhibit Avs activity. (A) Sche- ____>-p- 
matic of a pooled screen in 
; + — 0.54 e Forsur-7 
E. coli for phage early genes that Bap i 2 ee. 
escue Avs-mediated toxicity. ae i Amp 5 0 causes setcbiiie ciaener sala ainiiaieleibiias a 
Cm®, chloramphenicol resistance D 2 
: > > Z 
gene. (B) Deep-sequencing s D 3 1- EcAvs4 
eadout of antidefense candidate & ; 2 
genes coexpressed with SeAvs3, ? ? g 05-5 
EcAvs4, or KpAvs4. (C) A » @ o pe 
hypervariable early-gene locus Portal or Antidefense candidate “Sosnnnens pesacansespesssesnns pescnesnnsysnsnsccnsjesssasscapensececsayessscsscnjessccsnsayecs 
ay terminase (phage early gene) e 
within a closely related set © KpAvs4 
ZB 1-5 . Pp. 
of Autographiviridae phages i Transform § @ Lidtsur-6 
contains abundant antidefense E. coli <5 @ Smaasur-6 
Lidtsur-17 
genes. The tree was constructed Initial Final coun, © Usur-3 Mellemsur-6 
from a concatenated alignment 8 0 eecssceee: papsnenes agsecteMcere-elftts -epceere-cesqesseee@esgsesnassesqonescesseqessnEsane nee? 
of conserved proteins present in a 10 a - 40 oO ee 0 80 ve 
i= Antidefense candidate 
all 10 phages. Colors represent 3 | l 
groups of proteins clustered Gene e Hits (>0.02) © Other  e Anti-restriction 
at 40% sequence identity 
and 70% coverage. (D) Agarose D 
gel analysis showing in vitro Phage Genome % &-% 
fhe of sie Smaasur DBBODICDODM DOD —_ ra re es 
activity by three antidefense ntidefense - — wv YS 
~ J8-65 
candidates. (E) Schematic ODPODNDOOM 7 SeAvs3 + + + + + 
of the mechanism of Avs Lidtsur - DBDDDRMEDOLODY (HO gp1i9 HHH 
proteins as antiphage pattern- Glasur. D> DDDDPDNDDDDDOED JLDDD kb 
recognition receptors. | = 
> Usur D DDDACODDBCODIL JT 5- Lael 
Forsur —[) [DI DDDCDODDOD) JD 4 
Altidsur = D DDDDDDRONDILOD) LDP 
1 
~- Aldrigsur 1) DD DDRPROMDLDD JAD 
Mellemsur DDPWLDD j~—{_\ 7 0.5— 
— 
0.1 Megetsur [)[>)DDDARONILOD) J 
I 
0.0 6.0 8.7 41.5 
Distance (kb) 
E 


Infection + 
phage replication 


Gao et al., Science 377, eabm4096 (2022) 


Anti-defense expression 


12 August 2022 


Avs inhibition 


Cell death 
Population survival 


Phage propagation 
Population death 


9 of 15 


RESEARCH | RESEARCH ARTICLE 


translation and observed inhibition of SeAvs3 
nuclease activity in vitro by Lidtsur-17 and, toa 
lesser degree, by Forsur-7 (Fig. 7D). Lidtsur-17, 
Forsur-7, and Lidtsur-6 were also active in 
phage plaque assays and restored phage prop- 
agation on Avs-containing E. coli (fig. S27). Sur- 
prisingly, these inhibitors were active against 
different Avs families, including the chimeric 
Avs4, where the effector nuclease of EcAvs4 
was replaced with a transmembrane domain. 
Furthermore, the lack of detectable sequence 
similarity between these inhibitors suggests 
distinct modes of action, which resembles the 
case of the highly diverse anti-CRISPRs (38, 39). 
Further studies will be required to elucidate 
how these phage proteins block Avs activity. 


Discussion 


Here, we characterize four families of prokary- 
otic STAND NTPases and demonstrate that 
they are pattern-recognition receptors against 
two phage proteins: the large terminase sub- 
unit and the portal. These proteins, along with 
the major capsid protein, are the signature pro- 
teins of the virus realm Duplodnaviria, which 
unites tailed phages and tailed archaeal viruses 
with herpesviruses (25). Members of this realm, 
particularly tailed phages, are the most abun- 
dant among known viruses (42, 43). The portal 
protein nucleates virion assembly, occupying 
the distinctive pentameric vertex of capsids and 
providing the attachment site for the phage 
tail, and serves as the channel for genome 
entry into and exit from the capsid (44). The 
terminase is the motor that packages the phage 
genome into the capsid at high density and 
pressure, using the energy of ATP hydrolysis, 
and cleaves DNA concatemers into genome- 
size units (45). The universal, complex molecu- 
lar functions of these proteins engender strong 
selective constraints and hence evolutionary 
conservation. It is therefore logical that these 
particular proteins were selected as the targets 
for pattern recognition during the coevolution 
of prokaryotes with viruses. The three groups 
(Avs1 to Avs3) that recognize terminases 
do not form a clade in the phylogeny of the 
STAND domain (fig. S3), suggesting that de- 
fense based on terminase recognition evolved 
independently on multiple occasions. 

The in vitro reconstitution and structural 
analysis of two Avs systems described here pro- 
vide insight into their mechanism of defense, 
including promiscuous DNA endonuclease 
activity (Fig. '7E). Although many Avs proteins 
contain predicted nucleases, we identified di- 
verse N-terminal effector domains throughout 
Avs families, indicating distinct mechanisms 
of defense that remain to be characterized. The 
demonstration that at least some of these effec- 
tors can be swapped without compromising 
Avs defense function highlights the modular 
functionality of these proteins, which appears 
important for the diversification of defense 
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mechanisms. The effectors of SeAvs3 and 
EcAvsé4 are both activated by tetramerization, 
suggesting that the diverse Avs effectors are 
unified by the requirement for oligomerization 
for activity, acommon mechanism for signal 
transduction by STAND NTPases (46). Indeed, 
oligomerization is also involved in the activation 
of Cap4 nucleases in cyclic oligonucleotide- 
based antiphage signaling systems (26). 

Notably, Avs proteins recognize conserved 
structural features of their cognate targets 
across an extreme variety of amino acid se- 
quences, including those originating from both 
tailed phages and archaeal viruses, as well as 
eukaryotic herpesviruses, which are only dis- 
tantly related and do not infect prokaryotes. 
Structural analysis of Avs3 revealed that it di- 
rectly detects the active-site residues and ATP 
ligand of the terminase, thereby targeting the 
moieties that are the most difficult for phages 
to mutate without abrogating function. 

Notable similarities, but also several differ- 
ences, exist between eukaryotic NLRs and pro- 
karyotic Avs proteins. Both are intracellular 
receptors of the STAND superfamily that 
detect PAMPs through C-terminal repetitive 
structures. Both exhibit triggered oligomer- 
ization, but with distinct interfaces between 
the central ATPase domains (fig. S18). Simi- 
lar to plant NLRs like RPP1 and the Roq1 
resistosome, both Avs3 and Avs4 form tet- 
ramers, with the effector domains activated by 
forming a twofold symmetric dimer of dimers 
(29, 47). In the absence of their ligands, animal 
and plant NLRs have autoinhibited states that 
prevent oligomerization and effector activation 
(19). Whether such states exist for Avs proteins 
will require further investigation. In contrast to 
Avs proteins, NLRs often contain N-terminal 
adaptor domains that recruit other proteins 
to form macromolecular complexes known as 
inflammasomes or resistosomes (79). In these 
complexes, the effector (e.g., caspase-1) is a 
separate protein rather than a domain of the 
NLR. By contrast, Avs effectors are usually the 
N-terminal domain of the STAND NTPase. 
This simpler organization might be advanta- 
geous because counteracting phage replica- 
tion requires a rapid, direct cellular response. 
This contrast parallels the distinction between 
the mechanisms of prokaryotic and eukary- 
otic STING proteins, whereby bacterial STING 
homologs directly activate TIR domain effectors 
rather than regulate transcription, as mamma- 
lian STINGs do (9, 48). 

Bacteria and archaea encode numerous di- 
verse STAND NTPases beyond the four families 
characterized in this study (15). Although some 
are not involved in defense, such as the tran- 
scriptional regulators MalT (22), AfsR (23), and 
GutR (24), several are confirmed defense genes 
or are predicted to have a defense function 
based on their enrichment in genome regions 
adjacent to known defense systems (4, 49). We 
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investigated several of these other defense- 
associated systems (data S2) but observed no 
detectable toxicity when coexpressed with any 
of the 48 tested terminases or portals, suggest- 
ing that they are triggered by other pathogen- 
related patterns that remain to be identified. 
Further investigation will reveal whether these 
triggers are proteins and whether they are 
phage-specific or endogenous to the host. For 
instance, most characterized plant and fungal 
NLRs sense the state of host pathways rather 
than pathogen-specific proteins (19, 20), and it 
remains a possibility that other groups of pro- 
karyotic STAND NTPases function similarly. 

Given the extensive sequence divergence 
among the STAND NTPases, it is unclear 
whether Avs proteins are direct evolutionary 
ancestors of eukaryotic NLRs, although this 
remains a possibility. Alternatively, or addi- 
tionally, the characteristic tripartite domain 
architectures of diverse STAND NTPases could 
have evolved convergently, suggesting that this 
modular organization is a facile way to create 
allosterically activated enzymes that respond 
to PAMPs and could inspire the design of 
engineered molecular sensors. Overall, the 
results of this work advance our understanding 
of host-virus interactions in diverse microbes 
and extend the paradigm of pattern recogni- 
tion of pathogen-specific proteins to all three 
domains of life. 


Materials and methods 
Phylogenetic analysis of STAND NTPases 


For STAND phylogenetic analysis (fig. $3), 
PSI-BLAST searches (50) against the database 
of complete bacterial and archaeal genomes 
(extracted from GenBank, March 2019) were 
performed for three iterations using ATPase 
domains of seven Avs1 to Avs3 homologs (WP_ 
126523998.1, WP_115407481.1, WP_084.007836.1, 
WP_060615938.1, WP_139964370.1, WP_ 
063118745.1, and WP_001017806.1) investigated 
experimentally. The 2000 best hits from each 
run were taken and combined with 949 Avs4 
homologs found in the NCBI nonredundant 
protein sequence database (nr) in 2021. A non- 
redundant set of 4843 sequences was used for 
phylogenetic reconstruction using a hybrid 
UPGMA/FastTree approach as follows. At the 
first step, sequence clusters were obtained 
using MMseqs2 (57) with a sequence similarity 
threshold of 0.5, and the sequences within each 
cluster were aligned using MUSCLE (52). At the 
second step, cluster-to-cluster similarity scores 
were obtained using HHSEARCH (53) (in- 
cluding trivial clusters consisting of a single 
sequence each) and normalized by the mini- 
mum of the self-scores. Relative similarity 
scores (Ss) were converted to distances (d) de- 
fined as d = -In s, and a UPGMA (unweighted 
pair group method with arithmetic mean) 
dendrogram was constructed from the distance 
matrix (54). At the third step, sequence-based 
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trees were constructed from the cluster align- 
ments using FastTree (55) (WAG evolution- 
ary model, gamma-distributed site rates) and 
rooted by midpoint; these trees were grafted 
onto the tips of the profile similarity-based 
UPGMA dendrograms. FastTree was also used 
to calculate support values. Only the second 
step of the above procedure was applied to 
reconstruct the UPGMA dendrogram using 
multiple alignments of selected well-supported 
branches identified by the first procedure. 


Construction of avs phylogenetic trees 


Homologs of each of the four clades of avs 
genes were identified using PSI-BLAST searches 
against the NCBI nr database in June 2021 
using position-specific scoring matrices for 
each clade derived from manually curated mul- 
tiple sequence alignments (MSAs) of conserved 
regions. After a round of curation to remove 
false positives hits and partial proteins, refer- 
encing the corresponding genome assemblies 
to correct misannotated start codons, a list 
of 1584, 2342, 1018, and 1813 nonredundant 
full-length proteins were obtained for Avs1, 
Avs2, Avs3, and Avs4, respectively. To reduce 
sampling bias, sequences were then clustered 
at 95% sequence identity (minimum 80% 
coverage) using MMseqs2 with parameters- 
min-seq-id 0.95 -c 0.8-cov-mode 1. One repre- 
sentative from each cluster was selected for 
subsequent analyses, resulting in 843, 1255, 
630, and 1089 sequences for Avs1, Avs2, Avs3, 
and Avs4, respectively. 

MSAs of each Avs clade, excluding the varia- 
ble N-terminal domains, were generated using 
MAFFT v7.450 (56) with global pairwise align- 
ment (parameters-maxiterate 1000-globalpair). 
Alignments were trimmed using trimAl 1.2 with 
a gap threshold of 0.25 (-gt 0.25). Phylogenetic 
trees were built from the trimmed MSAs using 
1Q-TREE 1.6.12 (57) with the LG+G4 model 
and 2000 ultrafast bootstrap replicates (pa- 
rameters -nstop 500 -bb 2000 -m LG+G4). 
To categorize the N-terminal domains, the 
N-terminal sequences were clustered using 
MMsegs2 with parameters-min-seq-id 0.4 -c 
0.8, and a representative sequence from each 
cluster was analyzed using HHpred (58). Phyla 
classification was determined from the NCBI 
taxonomy database, and trees were rooted by 
midpoint and visualized using iTOL (59). 

The phylogenetic tree comparing the ATPase 
domains of NLR-like genes across model or- 
ganisms (Fig. 1A) was constructed in a similar 
manner, incorporating the set of 23 human 
NLRs and the best characterized NLRs from 
Arabidopsis thaliana (60) and Neurospora 
crassa (20, 61). 


Taxonomic distribution of avs genes 


To determine the taxonomic distribution of 
avs genes, genome assemblies containing 
one or more full-length Avs homologs were 
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identified through the NCBI Identical Protein 
Groups (IPG) database (data S8). Redundant 
assemblies were removed on the basis of their 
nine-digit accession numbers. To determine 
the percentage of genomes containing avs 
genes, the list of all available prokaryotic 
assemblies was downloaded from https:// 
ftp.ncbi.nlm.nih.gov/genomes/GENOME _ 
REPORTS/prokaryotes.txt. 


Construction of terminase and portal alignments 


Structures of all tested terminase and portal 
proteins were predicted using AlphaFold2 
(62), and structures were aligned and visual- 
ized using PyMOL 2.3.4. Representatives of 
the predicted structures were used as input 
for MSA construction using PROMALS3D (63). 
Before computing pairwise sequence identi- 
ties, MSAs were trimmed to retain only the 
regions corresponding to the core terminase 
or portal fold. 


Cloning 


Genes were chemically synthesized or ampli- 
fied with Q5 [New England Biolabs (NEB)] or 
Phusion Flash (Thermo Scientific) polymerase. 
Plasmids were assembled using Gibson as- 
sembly. Plasmid sequences were verified by 
Tn5 tagmentation and high-throughput se- 
quencing, as previously described (4, 64). 


Bacterial strains 


FE. coli NovaBlue and NovaBlue(DE3) were 
obtained from Millipore Sigma. FE. coli K-12 
(ATCC 25404) and strain C (ATCC 13706) were 
obtained from the American Type Culture 
Collection. All genetic assays were performed 
with E. coli NovaBlue(DE3) unless indicated 
otherwise. 


Competent cell production 


FE. coli strains were cultured in ZymoBroth 
with 25 ug/ml chloramphenicol and made 
competent using Mix & Go buffers (Zymo) 
according to the manufacturer’s recommended 
protocol. 


PhiV-1 fragment screen 


DNA fragments consisting of intact ORFs were 
amplified from phage PhiV-1 and cloned into 
expression plasmids after a LacI-repressed 
T7 promoter. Plasmids were pooled with an 
mNeonGreen-expressing control plasmid and 
transformed into E. coli NovaBlue(DE3) con- 
taining either SeAvs3, EcAvs4, or a pACYC184 
empty vector. An additional sample consisting 
of the plasmid pool transformed into empty 
vector-containing E. coli NovaBlue, which does 
not express from T7 promoters, was also in- 
cluded to assess the basal toxicity of the phage 
genes. 

After 1 hour of outgrowth in SOC (super 
optimal broth with catabolite repression) 
medium at 37°C, cells were plated on LB agar 
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plates containing 25 ug/ml chloramphenicol 
and 100 ug/ml ampicillin in the absence of 
isopropyl-B-D-thiogalactopyranoside (IPTG). 
Plates were incubated for an additional 12 hours 
at 37°C, after which surviving plasmids were 
isolated by miniprep (Qiagen). A total of 200 ng 
of plasmid for each condition was tagmented 
with Tn5d to yield an average fragment size of 
~500 base pairs. After addition of 0.5 volumes 
of 0.1% SDS and column purification, tagmented 
fragments were amplified over eight cycles by 
Q5 DNA polymerase (NEB) with sample-specific 
i5 and i7 index primers. Amplicons were gel ex- 
tracted and sequenced on a NextSeq (Illumina) 
using 150 cycles for the forward read. Reads 
were mapped to reference sequences using 
Geneious Prime. The read coverage of each 
sample was then normalized to the read cover- 
age of the mNeonGreen control within the same 
sample. Finally, for each sample, the read cov- 
erage per base was divided by the correspond- 
ing read coverage per base for the empty vector 
NovaBlue(DE3) control (Fig. 1D) or by that of 
the empty-vector NovaBlue control (fig. S4). 


Terminase and portal depletion screens 


Terminase and portal genes were amplified 
directly from phage samples or chemically 
synthesized (Twist Bioscience) with codon op- 
timization for E. coli. Genes were expressed 
under the control of a pBAD promoter. Plas- 
mids were pooled with an mCherry-expressing 
control plasmid and transformed into E. coli 
NovaBlue(DE3) containing an Avs system 
or a pACYC184 empty vector. After 1 hour of 
outgrowth in SOC at 37°C, cells were plated on 
LB agar plates containing 25 ug/ml chloram- 
phenicol and 100 pg/ml ampicillin with 0.002% 
arabinose or, in some cases, with 0.2% arabi- 
nose, as detailed in the figures. After an addi- 
tional 12 hours at 37°C, plasmids were isolated 
and sequenced, and depletion values (Fig. 2, B 
and D; and figs. S7 and S26C) were computed 
as described for the PhiV-1 depletion screen. 


Portal and terminase mutant screens 


Two synonymous versions of the T7 portal 
gene were randomly mutagenized by PCR 
using KAPA HiFi HotStart ReadyMix DNA 
polymerase (Roche) and cloned by Gibson 
assembly into a plasmid backbone containing 
a Lacl-repressed T7 promoter. Plasmids were 
column purified and transformed into E. coli 
NovaBlue(DE3) containing EcAvs4, KpAvs4, 
or CcAvs4. Cells were plated on LB agar plates 
containing 25 ug/ml chloramphenicol and 
100 ug/ml ampicillin in the absence of IPTG. 
After overnight growth, surviving colonies were 
sampled at random, cultured, and sequenced, 
and those containing single amino acid substi- 
tutions in the portal were retained for subse- 
quent analysis. To reduce the number of stop 
codons and frameshift mutants sampled, a fluo- 
rescent protein (mNeonGreen) was included in 
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the plasmid backbone immediately after the 
portal ORF with a single-nucleotide overlap, 
such that mNeonGreen was translated only if 
the portal ORF remained intact (65) (fig. $23, 
Aand B). Both portal and mNeonGreen were 
translated as separate polypeptides. 

To quantitatively assess the effect of each 
mutant on Avs4-mediated toxicity (fig. S23C), 
mutant plasmids were pooled and retrans- 
formed into Avs4-containing E. coli as de- 
scribed for the PhiV-1 depletion screen. Fold 
depletion was also quantified as described, 
with the exception that only reads containing 
20-mer sequences specific to one mutant (i.e., 
mapping to the mutation site) were counted in 
the analysis. A similar procedure was followed 
to quantify the effect of truncation of the 
terminase or portal (figs. S22 and S23E), as 
well as terminase-domain mutations (Fig. 5F). 


Antidefense screen 


Putative early genes from Autographiviridae 
coliphages were tabulated and clustered at 
50% sequence identity and 50% coverage using 
MMsegs2 (-min-seq-id 0.5 -c 0.5), resulting in 
120 clusters. One representative was selected 
from each cluster, along with two additional 
sequences, for a total of 122 initial candidates 
(data S9). Genes were synthesized by Twist 
Bioscience and cloned by Gibson assembly into 
expression vectors containing either the portal 
or terminase from phage PhiV-1 driven by a 
pBAD promoter. Antidefense candidates were 
expressed under the control of a Jac promoter. 
Plasmids were pooled and transformed into 
FE. coli containing SeAvs3, EcAvs4, KpAvs4, 
or an empty vector. Cells were grown at 37°C 
for 16 hours on LB agar plates containing 
25 ug/ml chloramphenicol and 100 ug/ml 
ampicillin with no added arabinose. After 
plasmid isolation, antidefense candidates 
were amplified over two rounds of PCR to 
attach eight-nucleotide i7 and i5 index bar- 
codes and sequenced with a 600-cycle MiSeq 
kit to ensure maximal coverage of each ORF. 
Reads containing mutations were discarded 
in the subsequent analysis. 


Phage plaque assays 


E. coli host strains were grown to saturation at 
37°C in LB or Terrific Broth. To 10 ml of top 
agar (10 g/liter tryptone, 5 g/liter yeast extract, 
10 g/liter NaCl, 7 g/liter agar), chloramphenicol 
was added to a final concentration 25 ug/ml 
and, if needed, ampicillin to a final concentra- 
tion of 100 ug/ml. Ten-fold dilutions of phage 
in phosphate-buffered saline were spotted on 
the plates. After overnight incubation at 37°C, 
plates were photographed in a dark room with 
a white backlight. 


Construction of mutant phages 
PhiV-1 gp8 and gp19 knockout phages were 
constructed as previously described (66, 67) 
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using plasmid donors with homology arms to 
gp8 or gpl9 inatrxvA E. coli strain [JW5856 
from the Keio collection (68)]. The trxA gene 
was inserted via the donor into the PhiV-1 
genome as a Selection marker. 


Protein purification 


Avs genes were cloned into pCDF-Duet expres- 
sion plasmids containing a C-terminal 6xHis 
tag. PhiV-1 gp8 and gp19 genes were cloned 
into TwinStrep-SUMO expression plasmids. 
Proteins were expressed in EF. coli BL21(DE3) 
cells (NEB no. C2527H). Cells were grown in 
Terrific Broth to mid-log phase, and the tem- 
perature was lowered to 18°C. Expression was 
induced at ODgoo 0.6 with 0.25 mM IPTG for 
16 to 20 hours before harvesting and freezing 
cells at -80°C. Cell paste was resuspended in 
lysis buffer (50 mM Tris-HCl pH 7.5, 500 mM 
NaCl, 5% glycerol) supplemented with EDTA- 
free cOmplete protease inhibitor (Roche). 
Cells were lysed using a LM20 microfluidizer 
device (Microfluidics), and cleared lysate was 
bound to either Strep-Tactin Superflow Plus 
(Qiagen) or Ni-NTA Superflow resin (Qiagen). 
For TwinStrep-SUMO phage proteins, the resin 
was washed with lysis buffer, and proteins 
were eluted with lysis buffer supplemented 
with 5 mM desthiobiotin. The TwinStrep- 
SUMO tag was removed by overnight digest 
at 4°C with homemade SUMO protease Ulp1 
at a 1:100 weight ratio of protease to target. 
Cleaved proteins were run on a Superose 6 
Increase column (GE Healthcare Life Sciences) 
with a final storage buffer of 25 mM Tris-HCl 
PH 7.5, 500 mM NaCl, 10% glycerol, and 1 mM 
dithiothreitol (DTT). 

Avs proteins containing 6xHis tags were 
bound to Ni-NTA resin in the presence of 
25 mM imidazole, washed with lysis buffer 
containing 50 mM imidazole, and eluted with 
lysis buffer containing 300 mM imidazole. 
SeAvs3 was diluted to a final concentration 
of 100 mM NaCl and purified using a Resource 
Q column on an AKTA pure 25 L (GE Health- 
care Life Sciences) with a 100 mM-1M NaCl 
gradient. EcAvs4 was further purified by di- 
luting to a final concentration of 100 mM NaCl 
and absorbing contaminants by flowing the 
protein over a Resource Q and Heparin HP 
column. SeAvs3 and EcAvs4 were concentrated 
and loaded onto a Superose 6 Increase column 
with a final storage buffer of 25 mM Tris-HCl 
PH 7.5, 500 mM NaCl, 10% glycerol, and 1 mM 
DTT. SeAvs3 for cryo-EM analysis was purified 
in the same buffer without glycerol and only 
300 mM NaCl, then concentrated to 1.4 mg/ml 
in a 500-ul 100,000 molecular weight cutoff 
(MWCO) Amicon spin concentrator. 


Avs complex purification 


Avs-TwinStrep constructs were cotransformed 
with plasmids expressing either gp8 or gp19 
into electrocompetent EF. coli BL21(DE3) (Sigma 
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Aldrich CMC0016) and grown and induced 
as before. Avs pulldowns using Strep-Tactin 
Superflow Plus resin were run on SDS-PAGE 
(SDS-polyacrylamide gel electrophoresis) gels, 
and gp8 and gp19 bands were excised and 
confirmed by mass spectrometry (Taplin Bio- 
logical Mass Spectrometry Facility, Harvard 
Medical School). For tandem affinity purifica- 
tion, plasmids containing SeAvs3-6xHis and 
gp19-StrepTag were cotransformed into elec- 
trocompetent EF. coli BL21(DE3) and grown and 
induced as before. An SeAvs3-gp19 complex was 
purified using Ni-NTA followed by Strep-Tactin 
SuperFlow Plus resin. The final elution was 
run on a Superose 6 Increase column and 
yielded a peak elution at 13 ml containing a 
1:1 ratio of SeAvs3 and gp19, as determined 
by SDS-PAGE band intensity analysis. A stan- 
dard curve was generated using the Bio-Rad 
Gel Filtration Standard (1511901), and the gel- 
phase distribution coefficient (K,,.) was cal- 
culated as (elution volume - void column)/ 
(column volume - void volume). 


SeAvs3-terminase complex formation 
for cryo-EM 


A total of 20 ug of SeAvs3 was mixed with 8.3 ug 
of PhiV-1 gp19 terminase in a total volume of 
24 ul in the presence of 17 mM Tris-HCl pH 
7.5, 280 mM NaCl, 0.8 mM DTT, 2% glycerol, 
5mM MgCly, and 1 mM ATP. The reaction 
was incubated at 37°C for 30 min and placed 
on ice for ~1 hour before cryo-EM grid prepa- 
ration. Cryo-EM grids were prepared on a 
Thermo Scientific Vitrobot Mark IV at 4°C 
and 100% humidity. A total of 3 ul of reaction 
was applied to a freshly glow-discharged (12 s 
at 15 mA) Cu 300 R2/2 holey carbon grid with a 
2-nm layer of amorphous carbon (Quantifoil). 
After 30 s, the grid was manually blotted with 
Whatman Grade 1 filter paper and plunged 
into liquid ethane. 


EcAvs4-portal complex formation for cryo-EM 


PhiV-1 gp8 was cloned into an MBP-bdSUMO 
expression plasmid, and EcAvs4 was cloned 
into a pCDF-Duet plasmid with an internal 
TwinStrep tag added between residues 114 
and 115. The EcAvs4 Mrr-like nuclease active 
site was mutated (Q63A/K65A) to allow co- 
expression with the portal. These two plasmids 
were cotransformed into E. coli BL21(DE3). 
A total of 6 liters of culture was grown in 
Terrific Broth to mid-log phase, and the tem- 
perature was lowered to 18°C. IPTG (0.25 mM) 
was added to induce expression, and growth 
was continued overnight. Cell paste was re- 
suspended in lysis buffer (50 mM Tris-HCl 
PH 7.4, 250 mM NaCl, 5% glycerol, 5 mM 
6-mercaptoethanol, 2 mM MgCly, 0.1 mM ATP) 
supplemented with EDTA-free cOmplete pro- 
tease inhibitor (Roche). Cells were lysed using 
a LM20 microfluidizer (Microfluidics) and 
cleared lysate was bound to Amylose Resin 
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High Flow (NEB). After extensive washing 
with lysis buffer, the resin was eluted overnight 
at 4°C by addition of 10 ug of homemade 
bdSENPI protease. Eluted protein was in- 
cubated with Strep-Tactin Superflow Plus 
resin, washed with lysis buffer, then eluted 
with lysis buffer supplemented with 5 mM 
desthiobiotin. The eluate was concentrated 
in a 6-ml Vivaspin spin concentrator (30,000 
MWCO) and run on a Superose 6 Increase 
column using 20 mM Tris-HCl pH 7.4, 200 mM 
NaCl, 2mM MgClo, and 0.1 mM ATP. Peak 
fractions containing EcAvs4 and gp8 were 
concentrated to 1.7 mg/ml using a 0.5 ml 
Amicon spin concentrator (100,000 MWCO) 
then immediately used for cryo-EM grid prep- 
aration. Cryo-EM grids were prepared on an 
Thermo Scientific Vitrobot Mark IV at 4°C and 
100% humidity. A total of 3 ul of sample was 
applied to a freshly glow-discharged (60 s at 
15 mA) Cu 300 R1.2/1.3 holey carbon grid 
(Quantifoil). The grid was blotted for 4 s with 
blot force +5 and drain time 1 s, then plunged 
into liquid ethane. 


Cryo-EM data collection 


All data were collected using the Thermo Sci- 
entific Titan Krios G3i cryo TEM at MIT.nano 
using a K3 direct detector (Gatan) operated in 
super-resolution mode with twofold binning 
and an energy filter with slit width of 20 eV. 

For SeAvs3-gp19, 15,422 movies were collected 
at 105,000x magnification giving a real pixel 
size of 0.8697 A, with defocus ranging from 
1 to 3.5 um with an exposure time of 1.15 s, 
fractionated into 30 frames and a flux of 
19.7 e /pix/s giving a total fluence per micro- 
graph of 30 e /A”. For EcAvs4-gp8, 22,902 movies 
were collected at 130,000x magnification giving 
areal pixel size of 0.6788 A, with defocus ranging 
from 1 um to 2.5 um with an exposure time of 
0.6 s, fractionated into 24 frames and a flux of 
23.6 e /pix/s giving a total fluence per micro- 
graph of 30.8 e /A”. 


Cryo-EM data processing 


All cryo-EM data were processed using 
RELION-4.0 (69). Movies were corrected for 
motion using the RELION implementation of 
MotionCor2, with 5-by-5 patches and dose- 
weighting. Contrast transfer function (CTF) 
parameters were estimated using CTFFIND-4.1. 
All reported resolutions use the gold-standard 
Fourier shell correlation with a cutoff of 0.143. 

For the SeAvs3-gp19 dataset, particle pick- 
ing was first carried out on 800 micrographs 
using the Topaz general model (70). A good 
subset of these particles, as determined by 
three-dimensional (3D) classification, was 
used to train Topaz, and this trained model 
was used to pick 128,500 particles from the 
entire dataset. Extracted particles, downscaled 
fourfold, were subjected to 3D classification 
without imposing symmetry using a reference 
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derived from a preliminary dataset. A total of 
44,489 particles, corresponding to 34.5% of 
picked particles, showed sharp features and 
apparent C4 symmetry and were reextracted 
without binning and refined with C4 symmetry 
imposed. After refining per-particle defocus 
and global magnification, beamtilt, trefoil, and 
performing Bayesian polishing, a reconstruc- 
tion was yielded at 3.8-A resolution with clear 
density for the SeAvs3 ATPase domain but 
blurred density for both the N-terminal nu- 
clease and C-terminal TPR+terminase domains. 

To improve density for the N-terminal nu- 
clease domains, 3D classification without 
alignment was performed while imposing C2 
symmetry. This revealed two equal popula- 
tions of particles each with clear density for 
the nuclease domains, related by a 90° rota- 
tion in the g axis. In the refinement STAR file, 
the parameter 7/nAngleRot was therefore in- 
cremented by 90° for one of these populations, 
before focused refinement starting at 1.8° local 
angular searches with a soft mask around 
the nuclease domains. This produced a re- 
construction at 3.4 A, measured using the 
same soft mask. 

To improve density for the C-terminal 
TPR+terminase domains, C4 symmetry ex- 
pansion was performed on the C4 refinement 
data.star file, followed by particle subtraction 
with recentering using a mask around one 
of the four asymmetric units. This generated 
four subparticles for each original particle. 
Refinement starting at 1.8° local angular 
searches with a soft mask, followed by CTF 
refinement and another round of refinement, 
produced a reconstruction at 3.4-A resolution. 

For the EcAvs4-gp8 dataset, 1825 particles 
from 80 micrographs were manually picked 
and used to train Topaz. The trained Topaz 
model then picked 444,626 particles from 
the entire dataset, which were extracted with 
fourfold binning and subjected with 3D classi- 
fication without imposing symmetry using the 
octameric (pseudo-D4) EcAvs4-gp8 reference 
derived from a preliminary dataset. A total of 
133,133 particles (29.9%) showing the same 
pseudo-D4 symmetry were reextracted at 
1.034 A/pix and refined with D4 symmetry 
imposed. After Bayesian polishing, this yielded 
a 3.7-A resolution reconstruction. D1 symmetry 
expansion followed by particle subtraction 
was then used to convert these particles to 
266,266 subparticles that correspond to the 
tetrameric complex. Like SeAvs3-gp19, these 
also had blurry density for the N-terminal 
nuclease and C-terminal TPR+terminase do- 
mains but additionally had poor density for 
the ATPase domains, suggesting a C2 recon- 
struction might be suitable for the whole 
tetramer. 

To improve overall density, a 3D classifi- 
cation without alignment was first performed 
with C2 symmetry imposed. This produced 
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two equally occupied classes, collectively rep- 
resenting a 169,977-particle subset (63.8%), 
that appeared identical but for a 90° rotation, 
but less clearly distinguished than the same 
analysis on SeAvs3-gp19. Therefore, they were 
refined together with local 1.8° angular searches 
and C2 symmetry but with “Relax symmetry: 
C4” to account for the pseudo-C4 symmetry. 
This produced a consensus C2 refinement but 
still with relatively blurred densities for the 
nuclease and C-terminal TPR domains. The 
nuclease domain density was improved by 
focused refinement with a soft mask, followed 
by refining anisotropic magnification, per- 
(sub)particle defocus, and beamtilt, trefoil 
and fourth-order aberrations, and second 
refinement, yielding a 2.9-A resolution re- 
construction. The C-terminal TPR domains 
were improved by C4 symmetry expansion, 
followed by C1 focused refinement with a 
soft mask and CTF refinement, but still had 
unclear density at the periphery at the site of 
an important EcAvs4-portal contact. There- 
fore, a final 3D classification was performed 
with a soft mask just around this contact and 
a regularization parameter (T) of 20. A total 
of 500,066 selected subparticles (73%) were 
then focus-refined with the same mask to yield 
a reconstruction at 3.0-A resolution with better 
density for this region. 


Model building 


Initial models for SeAvs3, EcAvs4, PhiV-1 
gp8, and PhiV-1 gp19 were generated using 
AlphaFold and fit into the cryo-EM maps using 
ISOLDE (77) with adaptive distance restraints, 
followed by manual rebuilding in Coot (72) 
and further refinement in ISOLDE. Coordi- 
nates were refined in real space using PHENIX 
(73), performing one macrocycle of global 
minimization and atomic displacement pa- 
rameter (ADP) refinement and skipping local 
grid searches. 


In vitro cleavage reactions 


Purified Avs proteins were incubated with nu- 
cleic acid substrates in reaction buffer (20 mM 
HEPES pH 7.5, 100 mM NaCl, 1 mM DTT, 
5% glycerol). Typical reactions contained ~100 ng 
of DNA substrate, 100 ng Avs protein, and 100 ng 
gp8 or gp19 in a 10-ul reaction volume. MgCl, 
was added at 5 mM where indicated, and ATP 
and AMP-PNP at 1 mM. Reactions were carried 
out at 37°C for the indicated time, and products 
were purified using a QIAquick PCR Purifi- 
cation column (Qiagen) before agarose gel 
analysis with a 1% E-Gel EX (Thermo Fisher 
Scientific). 


Bacterial two-hybrid assays 


Expression plasmids were cloned by fusing 
either the T18 or T25 fragments of CyaA from 
Bordetella pertussis to nuclease-deficient Avs 
proteins, as well as the PhiV-1 gp8 portal and 
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gp19 terminase. FE. coli BTH101 cells (F' , 
cya-99, araD139, galE15, galK16, rpsL1 (Str’), 
hsdR2, mcrA1, mcrB1) were cotransformed 
with pairs of T18 and T25 containing plas- 
mids. Overnight cultures were diluted 1:20 and 
plated on indicator plates containing 50 mg/ml 
ampicillin, 25 mg/ml kanamycin, 500 ug/ml 
ammonium iron(II) citrate, 300 ug/ml S-gal, 
and 0.5 mM IPTG. Cells were grown at 30°C 
overnight before imaging. 


Southern blot analysis 


E. coli K-12 (ATCC 254.04) cultures were grown 
to mid-log phase (ODgo9 0.5), and for each 
sample, 6 ml of culture was infected with wild- 
type or mutant PhiV-1 at a multiplicity of 
infection of 1. After 20 min at 37°C, before 
cell lysis, infected cells were pelleted and 
resuspended in 200 ul of media. After further 
incubation at 37°C, for a total of 90 min, 
samples were frozen in liquid nitrogen. DNA 
was extracted from 200-1 cultures by adding 
200 ul of lysis buffer (10 mM Tris-HCl pH 8.0, 
1mM EDTA, 100 mM NaCl, 1% SDS, 2% Triton 
X-100), 100 ul of glass beads, and 200 ul of 
phenol-chloroform (1:1) followed by brief vor- 
texing. Samples were centrifuged at 4°C, and 
DNA from the upper layer was extracted with 
chloroform and precipitated with the addition 
of 1 ml of ice-cold 100% ethanol and centrifu- 
gation at 4°C. DNA pellets were resuspended 
in 200 ul of Tris-EDTA (TE) with 300 ug RNAse 
A (Sigma-Aldrich) and incubated at 37°C for 
1 hour. DNA was once again precipitated with 
the addition of 1 ml of ice-cold 100% ethanol 
and 20 ul of 4M ammonium acetate, centri- 
fuged, dried, and resuspended in TE. 

DNA was digested with Eco47 III and run on 
a1% agarose gel in 1x Tris-borate-EDTA (TBE) at 
100 V. The gel was denatured with 0.5 M NaOH 
and 1.5 M NaCl for 30 min and neutralized 
with 1.5 M NaCl and 0.5 M Tris-HCl pH 7.5 for 
30 min. DNA was transferred to a Hybond N+ 
membrane (GE Healthcare Life Sciences) using 
overnight capillary flow and 10X SSC buffer 
(1.5 M NaCl, 150 mM sodium citrate, pH 7). 
Membranes were UV-crosslinked (Stratalinker 
1800, Agilent) and blocked at 61°C with Church 
hybridization buffer (250 mM NaH,PO, pH 7.2, 
1mM EDTA, 7% SDS). Radiolabeled probes 
complementary to the gp13 gene were gen- 
erated from purified PCR products using the 
Prime-It Random labeling kit (Agilent) and 
[o-32P]-dCTP. Membranes were probed over- 
night, washed three times with 61°C Church 
hybridization buffer, and exposed overnight 
with x-ray film (GE Healthcare Life Science) 
before developing. Quantification of phage 
DNA bands was performed in Fiji with back- 
ground signal subtracted. 
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translation of stored mRNAs 
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INTRODUCTION: In mammals, spermiogenesis 
(postmeiotic male germ cell differentiation) is 
a highly orchestrated developmental process 
controlled by a group of genes collectively 
referred to as spermiogenic genes. Because 
nuclear condensation during spermiogenesis 
gradually halts transcription, spermiogenic 
genes are transcribed in advance during the 
earlier stages of male germ development and 
stored as translationally inert messenger ribo- 
nucleoproteins (mRNPs) in developing sper- 
matids until they are needed for translation. 
Such inert mRNPs are usually organized into 
mRNP granules called germ granules, which 
serve as storage facilities for nontranslating 
mRNAs in various types of germ cells. How- 
ever, little is known about how those mRNAs 
stored in inert mRNPs are activated during 
late spermiogenesis. 


RATIONALE: To understand how translationally 
inert mRNAs are activated during spermio- 
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genesis, we screened potential translational 
regulators by proteomic analysis of polysomes 
from mouse testes. FXR1, a member of the 
fragile X-related (FXR) protein family, stood 
out from the screen as a translational regu- 
lator in late spermatids. By performing eCLIP 
and polysome profiling, in combination with 
generating a germline-specific Fxr1 knock- 
out (Fxri°*°) mouse model, we investigated 
whether FXR1 is required for translation ac- 
tivation in late spermatids. To decipher the 
mechanism underlying FXR1-mediated trans- 
lation regulation, we identified the potential 
cofactor(s) of FXR1 in mouse testes using im- 
munoprecipitation coupled with mass spec- 
trometry. We observed the formation of FXR1 
granules through liquid-liquid phase separation 
(LLPS), which recruits translation factors in late 
spermatids, and used the TRICK (translating 
RNA imaging by coat protein knock-off) re- 
porter system to determine whether FXR1 LLPS 
is required for target translation in cultured 


FXR1-containing granules mediate translation activation in late spermatids. During late spermiogenesis, 
elevated FXR1 undergoes LLPS to assemble target mRNAs as FXR1 mRNP granules that recruit translational 
machinery by interacting with the eukaryotic translation initiation factor 4 gamma 3 (EIF4G3) to activate 

the stored mRNAs in late spermatids. These phase-separated FXR1 granules drive a large translation program to 
instruct spermatid development and sperm production in mice. 
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cells. To further investigate whether FXR1 
LLPS is critical for target translation in mouse 
spermatids, we ectopically expressed wild-type 
FXRI1, LLPS-deficient FXR1”**!” mutants, or 
LLPS-restored FXR1°*!”-IDRFS mutants in 
Feri testes using lentiviral testis transduc- 
tion. Finally, by generating germline-specific 
Ferl"” knock-in mice, we determined wheth- 
er FXR1 LLPS is indispensable to translation 
activation in late spermatids, spermiogenesis, 
and male fertility in mice. 


RESULTS: We found that FXR1 was much more 
enriched in polysomes from 35-day postpartum 
(dpp) testes relative to 25-dpp testes, suggest- 
ing arole for FXR1 in translation activation in 
late spermatids. We identified a group of 770 
mRNAs as being likely direct FXR1-activated 
targets, and demonstrated that germline- 
specific Fxrl deletion in mice markedly reduced 
target translation in late spermatids. Consist- 
ent with FXR1 functioning in translation acti- 
vation in late spermatids, FrrI“*° male mice 
were infertile and displayed spermatogenic 
failure at late spermiogenesis. Interestingly, 
we observed a pronounced up-regulation of 
FXR1 and the formation of abundant, distinct 
condensates in late spermatids, suggesting 
concentration-dependent LLPS. Mechanistic 
studies revealed that FXR1 undergoes LLPS to 
form condensates that assemble target mRNAs 
as mRNP granules and then recruit transla- 
tional machinery to activate the stored mRNAs. 
Consistently, ectopic expression of wild-type 
FXRI or FXR1*"?-IDR®US, but not FXR1"”, 
activated target translation in cultured cells 
and successfully rescued target translation 
in late spermatids and spermiogenesis in 
Feri mice. Furthermore, Frrl™**? knock- 
in mutant mice highly phenocopy Fur1°*° 
mice, directly supporting the indispensability 
of FXR1 LLPS to target translation in late 
spermatids, spermiogenesis, and male fertil- 
ity in mice. 


CONCLUSION: Our findings demonstrate that 
FXR1 is an essential translation activator 
that instructs spermiogenesis in mice and 
unveil a key contribution of FXR1 LLPS to 
the translation activation of stored mRNAs 
in mouse spermatid and male fertility in 
mice. In addition, our study pinpoints the 
importance of LLPS in a developmental pro- 
cess in vivo. 
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Postmeiotic spermatids use a unique strategy to coordinate gene expression with morphological 
transformation, in which transcription and translation take place at separate developmental stages, 
but how mRNAs stored as translationally inert messenger ribonucleoproteins in developing spermatids 
become activated remains largely unknown. Here, we report that the RNA binding protein FXR1, a 
member of the fragile X-related (FXR) family, is highly expressed in late spermatids and undergoes 
liquid-liquid phase separation (LLPS) to merge messenger ribonucleoprotein granules with the 
translation machinery to convert stored mRNAs into a translationally activated state. Germline-specific 
Fxr1 ablation in mice impaired the translation of target mRNAs and caused defective spermatid 


development and male infertility, and a phase separation—deficient FXR 


1351P mutation in Fxr1 knock-in 


mice produced the same developmental defect. These findings uncover a mechanism for translational 
reprogramming with LLPS as a key driver in spermiogenesis. 


permiogenesis (postmeiotic male germ 
cell development) in mammals is a high- 
ly orchestrated process during which 
haploid spermatids undergo sequential 
programmed transitions, including nu- 
clear condensation and elongation, forma- 
tion of flagella and acrosome, and elimination 
of cytoplasm, eventually transforming into 
spermatozoa (7). Such a substantial differen- 
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tiation process is strictly controlled by a group 
of spatially and temporally regulated genes 
collectively referred to as spermiogenic genes. 
Because nuclear condensation during spermatid 
elongation gradually halts transcription, sper- 
miogenic genes are transcribed early and then 
stored as translationally repressed messenger 
ribonucleoproteins (mRNPs) until they are 
needed for translation (2, 3). Although this 
uncoupling between transcription and transla- 
tion provides an elegant mechanism for the 
temporal control of gene expression during 
spermiogenesis, how the stored mRNAs are 
converted into a translationally activated state 
in developing spermatids has remained large- 
ly unclear. 

Recent studies suggest that biomolecular con- 
densates are widely present as non-membrane- 
bound subcellular compartments in eukaryotic 
cells, which likely enrich proteins and/or nu- 
cleic acids to achieve spatiotemporal control 
in specific cellular processes (4, 5). Although 
many proposed condensates remained to be 
fully characterized, one well-documented ex- 
ample is RNP granules, in which RNAs bound 
by RNA binding proteins (RBPs) are thought 
to self-organize into membraneless organelles 
(6). This process, known as liquid-liquid phase 
separation (LLPS), has been attributed to in- 
trinsically disordered regions (IDRs) of RBPs 
(7-9), and increasing evidence suggests that 
multivalent interactions between RNA mole- 
cules also contribute to RNP assembly (70). In 
developing germ cells, RNP granules are better 
known as germ granules. They are prominently 
present in the cytoplasm and store nontranslat- 
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ing mRNAs (J7). For example, the chromatoid 
body (CB), the largest RNP granule in mam- 
malian male germ cells, stores >800 mRNAs in 
postmeiotic spermatids in mice (12). However, 
as with other inert mRNPs, little is known about 
how these mRNAs stored in germ granules are 
reactivated during germ cell development. 

FXR1 belongs to the fragile X-related (FXR) 
family consisting of FMR1/FMRP, FXRI, and 
FXR2, which are canonical RBPs containing 
two KH domains and an RGG box (13). FXR 
family proteins have been primarily docu- 
mented as translation suppressors that trans- 
port target mRNAs to different intracellular 
compartments (/4). In particular, FMR1/FMRP 
is also known to repress local translation at syn- 
apses, and defects in this critical translational 
control step are responsible for the patho- 
physiology of fragile X syndrome (15). FXR1 
has also been found in translationally inert 
mRNP granules (J6), where it may be respon- 
sible for destabilizing certain target mRNAs 
in adult neural stem cells (77). Additionally, 
FMR1/FMRP has been reported to enhance 
the translation of mRNAs stored in Drosophila 
oocytes (78). Here, we found that the FXR1 
protein undergoes LLPS to merge mRNP gran- 
ules with translational machinery to activate 
the translation of the stored mRNAs during 
spermiogenesis. 


Results 
FXR1 is a potential translational regulator during 
late spermiogenesis 


To understand how translationally inert mRNAs 
become activated during late spermiogenesis, 
we performed quantitative proteomic analysis 
to identify potential translational regulators in 
polysomes, which are active in protein synthe- 
sis (19), in mouse late spermatids (LS), which 
include elongating and elongated spermatids. 
Because polysomes in purified male germ cells 
collapsed, we prepared polysome fractions 
from mouse testes at two developmental 
time points: 25 days postpartum (dpp), which 
contain round spermatids (RS) but lack LS, 
and 35 dpp, which contain both RS and LS 
(Fig. 1A). Using proteomic analysis, we iden- 
tified 50 proteins in testicular polysomes 
annotated to translation regulation by Gene 
Ontology (GO) analysis, among which 12 pro- 
teins were significantly elevated in polysomes 
from 35- relative to 25-dpp testes (Fig. 1B and 
data S1). TARBP2, a translation activator for 
Protamine mRNA during late spermiogenesis 
(20), was more enriched in the 35-dpp tes- 
ticular polysomes (Fig. 1B and data S1), thus 
validating our screening strategy. 

We focused on FXR1 given the previously 
documented roles of its paralog, FMR1/FMRP, 
in regulating translation (/4, 18). The enrich- 
ment of FXR1 protein in 35- over 25-dpp testes 
(Fig. 1B) led us to hypothesize that FXR1 is in- 
volved in translation regulation in LS. Sucrose 
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Fig. 1. Screening potential translational regulators in mouse testes. 

(A) Schematic illustrating the experimental design for identifying translational 
regulators in mouse testes by polysome profiling coupled with mass 
spectrometry. (B) Identification of unique translational regulators in LS. Left, 
scatter plot showing the levels of all detected translational regulators in 35-dpp 
testicular polysomes relative to a 25-dpp control with two replicates, with 
the up-regulated (Up), down-regulated (Down), and unchanged proteins (NS) 


gradient analysis of polysome profiles showed 
that FXR1 level was elevated in the polysome 
fractions from 35- relative to 25-dpp testes, 
and puromycin treatment reduced FXR1 pro- 
tein present in the heavy polysome fractions 
from 35-dpp testes (Fig. 1C and fig. S1A). 
These results indicate that FXR1 is much more 
associated with polysomes in adult mouse 
testes. Fvrl was much more expressed in 
testes compared with other tissues from adult 
mice (fig. SIB), and, moreover, Pwr] expression 
in testes increased as the mice aged (fig. S1C) 
and FxrlI was expressed more highly in LS 
than in spermatocytes (SC) and RS (fig. S1D). 
Together, these results suggest FXR1 as a po- 
tential translational regulator in LS. 


FXRI1 binds translationally active mRNAs in 
mouse LS 


To determine whether FXRI1 regulates trans- 
lation during late spermiogenesis, we identi- 
fied FXR1 target mRNAs using eCLIP-seq (21) 
and found 4272 FXR1-bound mRNAs in 35-dpp 
testes (data $2). Like FMR1/FMRP (22), FXR1 
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binds predominately within coding regions of 
target mRNAs (~70%; fig. S2, A and B), and we 
did not detect consensus sequence motifs in 
FXR1-binding sites. GO analysis of FXxR1-bound 
mRNAs revealed a particular enrichment in 
spermatid differentiation and development, 
including flagellum formation, chromatin re- 
modeling, and sperm motility (fig. S2C). 
Many FXR1 targets have established roles in 
postmeiotic phase of male germ cell develop- 
ment, including Gpd2 (23), Akap3 (24), Hk1 
(25), Prkar2a (26), Stat4 (27), Cap2 (28), and 
Cylcl (29). RNA coimmunoprecipitation in 
combination with quantitative polymerase 
chain reaction (RIP-qPCR) assays showed sig- 
nificant enrichment of these seven mRNAs in 
anti-FXR1 immunoprecipitates in adult mouse 
testes (Fig. 2A, left). These seven mRNAs were 
significantly associated with FXR1 protein in 
polysome fractions from adult testes (Fig. 2A, 
right), suggesting that FXR1 is involved in 
their translation. RS and LS of adult mouse 
testes contained similar levels of each mRNA, 
but LS contained higher levels of proteins 
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indicated in red, blue, and gray dots, respectively. Horizontal dashed line 
indicates the FDR cutoff at 0.05; two vertical dashed lines indicate the 
fold-change cutoff at 1.5; right, label-free quantitation (LFQ) intensity of 
FXR1 in 35- and 25-dpp testicular polysomes. (C) Quantification of FXR1in 
sucrose gradient fractions from 25- or 35-dpp mouse testes with or without 
puromycin (Puro) treatment, with RPL7a serving as a reference (n = 3; data 
are shown as mean + SEM). 


(Fig. 2B), suggesting that these candidate FXR1 
targets are transcribed in earlier stages of male 
germ cells but are translated in LS. Immunos- 
taining of testis sections confirmed that GPD2, 
AKAP3, HK1, and STAT4 proteins were mainly 
translated in LS (Fig. 2C). These results together 
suggest an involvement of FXR1 in translation 
activation during late spermiogenesis. 


FXRI1 depletion impairs translation of a subset of 
genes in adult mouse testes 


To study the function of FXR1 in translation 
activation in LS, we generated germline- 
specific Fr] knockout mice (referred to as Fert ; 
fig. S3, A and B). GPD2, AKAP3, HK1, and 
STAT4 protein levels were reduced in Fert° 
LS (Fig. 2D, left), but their mRNA levels were 
not changed (Fig. 2D, right). Immunostaining 
on testis sections confirmed their significant 
reduction in FxrI“° LS (Fig. 2E). Consistently, 
sucrose gradient analysis showed that all tested 
target mRNAs were reduced in the polysome 
fractions, with a concomitant increase in lighter 
RNP and 40S and 808 fractions from Feri° 
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Fig. 2. FXR1 is required for the translation activation of a large group of 
genes in mouse testes. (A) RIP combined with RT-qPCR analyses of FXR1 binding 
to seven potential targets in adult mouse testes (left) or polysomes from adult 
mouse testes (right). (B) RT-qPCR (left) and Western blotting analyses (right) of 
FXR1 target expression in enriched RS and LS. (€ and E) Immunostaining of FXR1 
target expression (red) in testis sections from adult wild-type (C) and Fxr1°° 

or Fxr1"* control mice (E), with an enlargement of white framed region on the 
right. Nuclei and acrosomes are counterstained with DAPI (blue) and PNA 
(green), respectively. Scale bar, 10 um. (D) Western blotting (left) and RT-qPCR 
analyses (right) of FXR1 target expression in Fxr1°*° and control LS. (F) Sucrose 
gradient combined with RT-qPCR analyses of FXR1 target mRNAs from adult 
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(H) are representative of three and two independent experiments, respectively. 
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testes (Fig. 2F). As expected, puromycin treat- 
ment led to the reduction of all tested mRNAs 
in the polysome fractions (Fig. 2F and fig. S3C). 
By contrast, Fri deletion did not change the 
polysome-to-monosome ratio in adult testes 
(fig. S3C), indicating that FXR1 is not essential 
for global mRNA translation in mouse testes. 

We assessed how many genes might be 
subjected to activation by FXR1 in mouse 
testes. Because of the poor quality of ribosome 
profiling (ribo-seq) libraries generated in adult 
mouse testes, we performed polysome profil- 
ing in parallel with RNA sequencing (RNA- 
seq) and transcriptome sequencing (fig. S3, 
D and E) and identified 1207 mRNAs that 
showed significantly reduced translational 
efficiency (polysome-associated mRNA over 
total) in adult Fr. ‘1° testes relative to controls 
(Fig. 2G and data S3). These results suggest 
that FXR1 is involved in regulating the trans- 
lation of a large group of mRNAs in mouse 
testes. GO analysis showed that many affected 
genes were involved in spermatid differentia- 
tion and development (fig. S3F). All four vali- 
dated FXR1 targets, Gpd2, Akap3, Hk1, and 
Stat4, showed significantly reduced transla- 
tional efficiency in Fxr/-null testes (Fig. 2G and 
data S3). Cross analysis revealed that 770 of 
those 1207 mRNAs with reduced translational 
efficiency in Fxvri-null testes were directly 
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bound by FXRI (Fig. 2H), suggesting that they 
are direct FXR1 targets. More than 85% of 
FXR1-activated target mRNAs were not en- 
riched in the CB granule (fig. S3G), implying 
that FXR1 likely organizes a distinct RNP 
platform to regulate spermiogenic mRNA 
translation. Moreover, FXR1 target mRNAs 
are significantly longer, especially their coding 
regions (fig. S3H), implying that FXR1 pref- 
erentially promotes the synthesis of large 
proteins. These results indicate that FXR1 is 
involved in translation activation of a subset 
of mRNAs in mouse testes. 


FXR1 is essential for spermiogenesis and male 
fertility in mice 

Despite the fact that FxrI**° male mice had 
normal size testes and epididymides (fig. S4A), 
they were sterile (Fig. 3A), whereas control 
Fart“ males were fertile. Frri“° testes had 
normal numbers of RS and elongating sper- 
matids but reduced condensed spermatids 
at steps 14 to 16 (stages II to VIII) in the 
seminiferous tubules (Fig. 3, B and C), con- 
comitant with apoptotic signals detected in LS 
(Fig. 3D). Consistently, transmission electron 
microscopy (TEM) of seminiferous epithelium 
revealed extensive defects in Fxr1°° LS (fig. 
S4B). Moreover, we observed few spermatozoa 


in Feri epididymides (Fig. 3E and fig. S4C). 


B Stage: 


These results demonstrated the functional im- 
portance of FXR1 in spermatid development 
and male fertility in mice. 


FXRI recruits translation factors to FXR1 
granules in LS 


To understand the mechanism underlying 
FXR1-mediated translation activation, we 
next performed immunoprecipitation (IP) 
coupled with mass spectrometry to identify 
the potential cofactor(s) of FXR1 in mouse 
testes. A total of 125 proteins were significantly 
enriched in anti-FXR1 IP pellets (fig. SoA and 
data $4), with a GO term enrichment in pos- 
itive regulation of translation (fig. S5B). Among 
them was the eukaryotic translation initiation 
factor 4 gamma 3 (EIF4G3) (fig. S5C), a scaf- 
fold protein that organizes the assembly of the 
EIF4F complex onto mRNAs for the initiation 
of cap-dependent translation (30, 37). The 
mouse genome encodes three EIF4G members, 
but only EIF4G3 has been shown essential for 
male fertility (32). We confirmed the interac- 
tion between FXR1 and EIF4G3 by co-IP in both 
mouse testes and cotransfected 293T cells 
(Fig. 4, A and B) and found their interaction to 
be resistant to RNase A treatment, suggesting 
a direct interaction between them. RIP-qPCR 
showed that all tested FXR1-activated targets 
were substantially reduced in anti-EIF4G3 


Fig. 3. Germline-specific depletion of FXR1 causes male infertility in mice. 
(A) All tested Fxrl°° males were infertile. (B) H&E staining of paraffin-embedded 
testis sections from adult control Fxrl"*t and Fxr1°"° mice. The stages of seminiferous 
epithelium cycles were determined by the morphology of spermatocytes and 
spermatids. Pl, preleptotene; L, leptotene; Z, zygotene; PS, pachytene spermatocyte; 
SS, secondary spermatocyte; ES, elongating spermatids. Scale bar, 20 jum. 

(C) Comparison of the ratios of postmeiotic spermatids versus Sertoli cells in 
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tubule cross sections of specific stages of seminiferous epithelial cycles and 
corresponding spermatid development steps from adult control and Fxrl°*° mice. 
(D) Terminal deoxynucleotidy| transferase-mediated deoxyuridine triphosphate 
nick end labeling (TUNEL) assay of testis sections from adult control and Fxr1°*° 
mice. Scale bar, 10 um. (E) H&E staining of paraffin-embedded cauda epididymis 
sections from adult control and Fxr1°"° mice. Scale bar, 100 um. Results of 
three independent experiments are shown. 
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immunoprecipitates from Frri°*° testes 
compared with controls (Fig. 4C), indicating 
that FXR1 is required for EIF4G3 to efficiently 
bind to target mRNAs. Despite little interac- 
tion between FXR1 and EIF4G1 or EIF4G2 in 
mouse testes (Fig. 4A and data S4), both 
EIF4G1 and EIF4G2 interacted with FXR1 
under overexpression conditions in cotrans- 
fected 293T cells (fig. S5D). The specificity 
of the FXR1-EIF4G3 interaction in mouse 
testes can likely be attributed to their sim- 
ilar expression pattern during spermatogenesis 
(fig. S5E). These findings point to a role of 
EIF4G3 in FXR1-mediated translation activa- 
tion in LS. 

Given that FXR1 targets are translationally 
activated primarily in LS (Fig. 2), we next 
investigated whether the FXR1-EIF4G3 in- 
teraction is differentially regulated during 
spermatogenesis in mice. The FXR1-EIF4G3 
interaction was detected in SC and RS but 
became more evident in LS because of their 
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Fig. 4. FXR1 interacts with EIF4G3 in mouse testes. (A, B, and D) Co-IP 
assay of FXR1-EIF4G3 interaction in mouse testes (A), Myc-FXRla- and Flag- 
EIF4G3-cotransfected 293T cells (B), or enriched SC, RS, and LS (D). (€) RIP 
combined with RT-qPCR analyses of EIF4G3 binding to the four FXRI1 targets in 
adult Fxr1°"° testes relative to controls. (E) Immunostaining of FXR1 (green) 
and EIF4G3 (red) in mouse male germ cells isolated from adult testes, with 

ei counterstained 


with DAPI (blue). Middle, 
and LS. Right, line plot of 


FXRI1 colocalization and re 


shown. Significant differen 


elevated expression there (Fig. 4D). FXR1 and 
EIF4G3 were largely present throughout the 
cytoplasm of SC and RS but preferentially 
formed abundant, distinct condensates that 
became more colocalized in LS (Fig. 4E). Given 
the markedly elevated FXR1 and EIF4G3 pro- 
teins in LS (Fig. 4, D and E), the formation of 
condensates may be driven by a concentration- 
dependent process characteristic of LLPS. 

In addition to EIF4G3, several other trans- 
lation factors, including poly(A) tail binding 
protein C1 (PABPC1), EIF4A3, and ribosome 
components, were enriched in FXR1 complexes 
from adult mouse testes (fig. S5C and data S4). 
PABPC1 has been reported to be actively in- 
volved in translation during spermatogenesis 
(33), likely facilitating translation initiation 
through association with eIF4G (34). Indeed, 
PABPC1 colocalized with FXR1 in SC and RS 
and was recruited to FXR1 condensates in LS 
(fig. SSF). Moreover, the large ribosomal sub- 


unit protein RPL7a was in close proximity to 
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FXR1-containing granules in LS (fig. S5G). 
These results suggest that FXR1 orchestrates 
translation activation of its target mRNAs by 
recruiting multiple components of the transla- 
tion machinery. 


FXR1 undergoes phase separation to form mRNP 
granules in LS 


The observation of FXR1-containing conden- 
sates in LS suggests its distinct functions in 
early versus later stages of spermiogenesis 
(Fig. 4E and fig. $5, F and G). FMR1/FMRP, 
another FXR family protein, can form bio- 
molecular condensates through LLPS (35). 
We investigated whether FXR1 also undergoes 
phase separation to form condensates in mouse 
LS. FxrlI encodes seven isoforms, FXRla to 
FXRig, through alternative splicing (36), and 
all of these contain IDRs at the C terminus (fig. 
S6A). We found all seven Fxr! transcript iso- 
forms in mouse testes (data $5). FXRla (a 
shorter form) and FXRid (a longer form) 
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proteins appeared to be relatively more abun- 
dant in LS (fig. S6C). Purified FXR1 isoforms 
formed spherical droplets in vitro (Fig. 5A and 
fig. S6, D and E). After photobleaching, FXRla 
droplets recovered most fluorescence signals 
within ~180 s in fluorescence recovery after 
photobleaching (FRAP) analysis in vitro (Fig. 
5B). Moreover, all FXR1 isoforms formed 
round condensates that exhibited fusion be- 
havior in C2C12 cells (Fig. 5C and fig. S6F), 
whereas FXR1la condensates recovered most 
fluorescence signals within ~120 s after photo- 
bleaching (Fig. 5D). Given that the diameters 
of FXR1 droplets/granules (1.0 to 5.0 um) were 


much larger than the resolution of confocal 
microscopy (~120 nm), the captured FXR1 
droplets and/or granules may represent their 
real structures or features (37). These results 
demonstrated that FXR1 proteins could undergo 
LLPS and that FXRI1 condensates display liquid- 
like features in vitro and in cellulo. In addition, 
we estimated the cytoplasmic concentration of 
FXRI proteins to be ~4 1M in RS and ~23 uM in 
LS (fig. $7), corresponding to abundant, distinct 
FXR1 condensates in LS but not in RS (Fig. 4E 
and fig. S5, F and G). 

To determine whether FXR1 condensates 
function to assemble target mRNAs as mRNP 
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Fig. 5. FXR1 forms mRNP granules with its target mRNAs through LLPS. 
(A) LLPS of purified recombinant FXRla at various amounts and with increasing 
concentrations of NaCl under 10% PEG. Left, summary of phase-separation 
behavior of FXR1a; right, representative fluorescent microscopy images. 

Scale bar, 5 um. (B) FRAP analysis of FXRla droplets in vitro. Scale bar, 2 um. 
(C) Live-cell images showing the fusion of smaller EGFP-FXR1a granules 
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granules, we mixed Alexa Fluor 488-labeled- 
FXRia protein with Cy3-labeled-Gpd2 mRNA. 
FXRI droplets enriched target mRNA Gpd2 
in vitro, and the addition of Gpd2 promoted 
FXRI1 LLPS in a dosage-dependent manner 
(Fig. 5E). The FXR1 target mRNAs Gpd2 and 
Akap3 were dispersed in the cytoplasm and 
were significantly colocalized with FXR1 but 
not with DDX4 (a CB marker) in RS, and then 
became accumulated into FXR1 condensates 
in LS (fig. S8A). Conversely, the MIWI target 
Tbpll was highly colocalized with DDX4 but 
not with FXR1. Moreover, biotin-labeled Gpd2 
or Akap3 antisense pull-down showed that 


in vitro FRAP: recombinant 


Eh 60 9 120 150 180 
Recovery time (s) 


in cetiuo FRAP: over-expressed 


Merge Merge 


into larger spherical granules in transfected C2C12 cells. Scale bar, 10 um. 
(D) FRAP analysis of EGFP-FXR1a in transfected C2C12 cells. Scale bar, 

2 um. (E) LLPS of purified recombinant FXRla at various amounts and 
with increasing concentrations of Gpd2 mRNA (~1000 nucleotides) in 

150 mM NaCl without PEG. Scale bar, 5 um. Results of three independent 
experiments are shown. 
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FXRI granules contained different target mRNAs 
(fig. S8B). These results support that FXR1 and 
target mRNAs form mRNP granules in LS. 


FXR1 phase separation is essential for 
translation activation of its targets 


We used a TRICK (translating RNA imaging 
by coat protein knock-off) reporter system as 
described previously (38, 39) to determine 
whether FXRI granules are required for the 
translation of its targets (fig. S9A). As expected, 
cotransfection of NLS-PCP-GFP [a PP7 coat 
protein (PCP) fused to a nuclear localization 
sequence (NLS) and green fluorescent protein 
(GFP)] and NLS-MCP-RFP [a MS2 coat pro- 
tein (MCP) fused to an NLS and RFP] resulted 
in two-color labeling of nuclear Gpd2 mRNA 
in transfected C2C12 cells (Fig. 6A, panel I), 
indicative of the untranslated state. Coexpres- 
sion of blue fluorescent protein (BFP)-tagged 
FXRia yielded a fraction of Gpd2 into FXRla 
granules (blue) in the cytoplasm (Fig. 6A, 
panels II and III). Gpd2 mRNA in FXRla 
granules was labeled with RFP but not GFP, 
whereas puromycin treatment resulted in 
labeling of the mRNA in FXRla granules by 
both GFP and RFP. These results demonstrate 
that FXRla recruits target mRNA into FXRla 
granules to trigger its translation. 

We mapped the leucine residue at pos- 
ition 351 in the KH2 domain of FXRla as an 
essential residue for phase separation (fig. S9, 
B to D). However, the FXRla*” mutant main- 
tained normal RNA-binding capability (fig. S9, 
E and F), whereas a L351P mutation impaired 
FXRI phase separation likely by altering the 
hydrophobic pocket and conformation of Y353 
and Y357 in the KH2 domain (fig. S9, G and H). 
As expected, coexpression of BFP-FXRla”” 
led to the recruitment of Gpd2 mRNA in the 
cytoplasm, but the mRNA was untranslated, 
because it was labeled with both GFP and RFP 
with or without puromycin treatment (Fig. 6A, 
panels IV and V). These results indicate that 
FXRia™”” can recruit target mRNA to the cyto- 
plasm but cannot activate its translation, sup- 
porting that FXRI1 relies on its phase-separation 
ability to activate target translation. Support- 
ing this, the addition of FUS N-terminal IDR 
region (IDR*”S) to the FXRia!**!” mutant 
(FXRia’””-IDR'S mutant) resulted in the for- 
mation of FXRla/Gpd2 granules in the cyto- 
plasm and active translation of Gpd2 mRNA 
in the granules (Fig. 6A, panels VI and VII). 
Western blot also confirmed that FXRia’°!” 
lost the stimulatory effect on Gpd2 translation 
considerably, whereas its phase separation- 
restored mutant rescued GPD2 expression 
(Fig. 6B). These data support that phase sep- 
aration is essential for FXR1-mediated target 
translation. 

To further investigate whether phase sepa- 
ration is critical for FXR1 function in trans- 
lation activation in mouse spermatids, we 


Kang et al., Science 377, eabj6647 (2022) 


constructed FXR1-internal ribosome entry 
site (IRES)-cytochrome oxidase IV N-terminal 
(Cyto IV)-fused enhanced GFP (EGFP) lentiviral 
vectors (fig. S9I) to ectopically express wild-type 
FXR1, FXR1*?°!”, or FXRID**!?-IDRIUS in 
Far°*° testes (40). We found that wild-type 
FXRla, FXRid, or their FXRI°!”-1DRFUS 
mutants, but not FXR1'**? mutants, rescued 
FXR1 granules (Fig. 6C) in Fvri-null LS and 
protein expression of target genes without 
altering their mRNA levels (Fig. 6D). 


FXR1 phase separation is required for sperm 
formation in mice 

We found that the ectopic FXR1 protein res- 
cued sperm formation in transduced FurI° 
mice. The coexpressed Cyto IV-tagged EGFP 
anchors to the mitochondria of sperm and 
thus allowed us to track the spermatozoa 
progressed from transduced spermatids (fig. 
S91). As shown by differential interference 
contrast microscopy, spermatozoa were ab- 
sent in epididymides from untransduced or 
control Cyto IV-EGFP-transduced FrrI‘*° 
mice, whereas spermatozoa were detectable 
in epididymides from wild-type FXRla, FXRid, 
or their FXR1**"?-IDR'™S mutant-transduced 
mice, but not FXR1'**'?-transduced mice (Fig. 
6E). As expected, all of these spermatozoa 
were GFP", indicating that they had progressed 
from the transduced Frr1*° spermatids. These 
spermatozoa fertilized oocytes in intracyto- 
plasmic sperm injection to produce blastocysts 
(Fig. 6F), indicating that they maintain the 
genome integrity required to support embry- 
onic development. Moreover, similar to the 
active controls (movie S1), all GFP* spermato- 
zoa swam actively (movies S2 to S5), suggest- 
ing that they are functional spermatozoa. We 
conclude that phase separation of FXR1 is 
functionally required for spermatid develop- 
ment in mice. 


FXR1 phase separation is indispensable to male 
fertility in mice 

We generated germline-specific Fur1’?*!? 
knock-in (referred to as Fert“) mice (fig. S10, 
A and B), and found that translation of tested 
FXRI1 targets was reduced in FerI™ LS (fig. 
$10C). Far“ males were sterile (fig. S10D) 
with condensed spermatids at steps 15 and 
16 (stages IV to VIII) that were substantially 
reduced in the seminiferous tubules, apoptotic 
signals were detected in LS (fig. S10, E to 
G), and spermatozoa were barely present in 
epididymides (fig. S10, H and I). These results 
show that Frr1™ males highly phenocopy 
Far1°*° mice, supporting the functional re- 
quirement of FXR1 phase separation for sper- 
matid development and male fertility in mice. 


Discussion 


Our results uncovered FXR1 as a key transla- 
tion activator in LS and identified a large 
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group of spermiogenic genes as likely being 
direct FXR1-activated targets. Our mechanistic 
studies revealed that the FXR1 protein under- 
goes LLPS to form mRNP granules and recruit 
the translational machinery to translationally 
activate stored mRNAs within them (Fig. 6G). 
In RS, where FXR1 is expressed at a low level, 
the newly transcribed FXR1 target mRNAs 
may be assembled into translationally inert 
mRNPs by association with a low level of FXR1 
and other cytoplasmic RBPs and/or intermole- 
cular RNA-RNA interactions. In LS, elevated 
FXRI protein may “invade” these mRNPs using 
phase separation to gradually disrupt existing 
RNA-RNA interactions and/or replace other 
RBPs to transform the stored mRNAs into new 
FXR1-dominated mRNP granules. During this 
transformation process, FXR1 also recruits the 
translation machinery through its direct in- 
teraction with EIF4G3, thereby converting its 
target mRNAs into a translationally activated 
state. The proposed mechanism suggests a 
key role of FXR1 granules in enriching both 
mRNA transcripts and the translational ma- 
chinery, whereby it induces the translation of 
spermiogenic mRNAs in LS. 

FXR1 is required for muscle development in 
mice (47). Similar to our observations, a recent 
study showed that FXR1 also formed bio- 
molecular condensates in muscle cells (42), 
but the function of FXR1 phase separation in 
translational control and muscle development 
remains to be determined. Recessive mutations 
in exon 15 of FXR1 are associated with congen- 
ital multi-minicore myopathy, and a patient 
carrying such mutations also showed hypo- 
plastic genitalia and cryptorchidism (43). These 
findings suggest that defective FXR1 functions 
in humans might induce abnormal development 
in both muscles and the male genital system. 

In contrast to the well-documented roles 
of FXR family members in translational rep- 
ression, FXR1 enhances translation in mouse 
spermatids. Similar to our observations, an 
early report showed a positive role of FXR1 in 
Ago2 and/or miRNA-mediated translation 
activation in quiescent conditions (44). In addi- 
tion, a recent study report that FXR1 promotes 
cMYC translation in cancer cells by recruiting 
the eIF4F complex to the translation start site 
(45). Our finding is also reminiscent of a recent 
study in Drosophila oocytes, in which FMR1/ 
FMRP preferentially stimulated the translation 
of a large set of stored mRNAs (/8). Given the 
lack of binding specificity for either FXR1 or 
FMR1/FMRP, we suspect that FMR1/FMRP 
may use a similar phase-separation mechanism 
to activate its target mRNAs in Drosophila 
oocytes. Thus, the current findings suggest a 
multifaceted function of FXR family proteins, 
likely in a context-dependent manner, provid- 
ing important implications for the function of 
FXR family members in critical development 
and disease processes. 
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Fig. 6. Phase separation A 
is essential for FXR1 
function in translation 
activation and spermatid 
development in mice. 

(A) Live-cell imaging 
analysis of the effect 

of wild-type FXR1a, 
FXR1a‘S5!P or FXR1al35!P- 
IDRFUS mutants on 

Gpd2 translation using the 
TRICK reporter system 

in cotransfected C2C12 
cells. Left, representative 
live-cell images, with the 
nucleus and FXR1 granules 
indicated by dotted lines D 
and arrows, respectively. 
Scale bar, 2 um. Right, 
schematic illustrating the 
translation state of 
Gpd2. (B) Western blotting 

of GPD2 expression in the 
indicated cotransfected 

C2C12 cells, with B-actin 
serving as an internal 

control. (C) Immunostain- 

ing of FXR1 (red) in mouse 
spermatids isolated from 

the indicated mouse 

testes. Nuclei are counter- 
stained with DAPI (blue). 

EGFP (green) indicates the E 
transduced cells, with 
epresentative enlarged 

views of white framed LS 
shown on the right. Scale " 
bar, 10 um. (D) Western 
blotting (left) and 

RT-qPCR analyses (right) 

of FXR1 target expression 

in EGFP* spermatids 

sorted from transduced 

Fxr1°"° testes. (E) Differ- 

ential interference contrast F 
microscopy images of 
epididymal sperm from 
indicated lentiviral expres- 
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maintained the genome integrity required to fertilize oocytes and support embryonic development by intracytoplasmic sperm injection assay. (G) Schematic model showing that 
FXRI1 activates the translation of spermiogenic mRNAs in LS through phase separation and by interacting with components of translation machinery. Results shown are 
representative of three independent experiments. Significant differences by Student's t test are marked with asterisks: *P < 0.05, **P < 0.01, ***P < 0.001, n.s., not significant. 


Materials and Methods 

Mice 

All animal studies were approved by the 
Shanghai Institute of Biochemistry and Cell 
Biology (SIBCB) Institutional Animal Care and 
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Research Advisory Committee. All mouse lines 
used in this study were housed under specific 
pathogen-free conditions. FvrI"/"™* mice 
were obtained from D. L. Nelson’s laboratory at 
Baylor College of Medicine (41), and Stra8-Cre 
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mice were obtained from M. Tong’s labo- 
ratory at SIBCB (46). The germline-specific 
Far! knockout mice (referred to as FxrI*°) 
were generated by crossing FrrI°/" mice 
to Stra8-Cre mice (fig. S3A). The N-terminal 
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Flag-tagged Fxrl knock-in mice and Ferri'?*"” 
knock-in mutant mice were generated in the 
present study using the CRISPR-Cas9 system 
combined with semi-cloning technology by 
Genome Tagging Project (GTP) Center, SIBCB 
(figs. S6B and S10A). The germline-specific 
Fert" knock-in mutant mice (referred to 
as Fort) were generated with germline- 
specific FxrI knockout mice by Stra8-Cre- 
mediated recombination (fig. SIOB). Four 
Fert, Fert, or control Ferri“ male mice 
from two litters were scored in the fertility 
study, with 8 to 12 months mating continued. 
Briefly, RxrI“°, Fxr1“, or control male mice at 
3 months of age were mated to 2-month-old 
wild-type C57BL/6 female mice. The female 
was replaced with a 2-month-old female mouse 
every 2 to 3 months. Cages were monitored daily 
for the birth and number of newborn pups. 


Cell culture and transfections 


The mouse myoblast cell line C2C12 was pur- 
chased from the cell bank of the Type Culture 
Collection of Chinese Academy of Science 
(TCCCAS). Cells were cultured in Dulbecco’s 
modified Eagle’s medium (DMEM) with 10% 
fetal bovine serum (FBS). Transfections were 
performed with Lipofectamine 3000 (Invitrogen) 
according to the manufacturer’s instructions. 
All DNA plasmids were cleared of endotoxin. 
For live-cell imaging experiment, 1.5 ug of plas- 
mids of GFP-tagged FXR1 or mutants were 
transfected in a 35-mm dish with a glass-like 
polymer bottom (Cellvis, D35-20-1.5P). For 
the TRICK reporter assay, 1.0 ug of GFP-NLS- 
PCP expression vector, 1.0 ug of RFP-NLS- 
MCP expression vector, 1.5 ug of BFP-FXRla 
or its mutant expression vector, and 1.5 ug of 
Gpd2-TRICK vector were cotransfected in C2C12 
cells in a 35-mm dish. For the protein interac- 
tion-mapping experiment, 1.5 ug of expression 
vectors for each protein were cotransfected in 
293T cells in each well of a six-well plate. 


Plasmids and RNA oligonucleotides 


The TRICK reporter pCMV-Gpd2-TRICK was 
derived from pMax-pona-12xTRICK-24«MS2SL 
(Addgene, catalog #64542), with the original 
promoter replaced with the CMV promoter 
and the pona gene replaced by a Gpd2-coding 
region. The promoters of pHsp83-NLS-HA- 
2xPCP-2xGFP (Addgene, catalog #71243) and 
PHsp83-NLS-HA-2xMCP-2xTagRFP-T (Addgene, 
catalog #71242) were replaced with the CMV 
promoter. pCMV-Myc-FXR1 and pCMV-EGFP- 
FXRI vectors were constructed by insertion 
of the coding region of the indicated mouse 
Feri isoforms into pCMV-Myc and pCMV- 
EGFP-C1 (Clontech), respectively. pCMV-BFP- 
FXRI were constructed by replacement of 
the EGFP of pCMV-EGFP-FXR1 with BFP. The 
pCMV-Flag-EIF4G isoform vectors were con- 
structed by the insertion of the coding regions 
of mouse EIF4G isoforms into pCMV-3xFlag 
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(Clontech). The lentiviral vectors pLV-Eflo- 
FXRI1-IRES-Cyto IV-EGFP were generated as 
we recently described (47, 48). The coding 
regions of mouse Fivrl isoforms were cloned 
into the pET51b vector for their expression in 
E. coli. All FXR1 mutants were generated 
using the Site-Directed Mutagenesis Kit (New 
England Biolabs). All constructs were verified 
by DNA sequencing. 


Protein expression and purification 


Bacterial expression and purification of pro- 
teins were performed as we described recent- 
ly (49). Recombinant FXR1 proteins were 
expressed in F. coli strain BL21. Cultures were 
grown in Luria broth medium at 37°C to an 
optical density at 600 nm (OD¢oo) of 0.6. Af- 
ter induction with 0.2 mM isopropyl-f-p- 
thiogalactopyranoside, the cells were grown 
at 18°C overnight, harvested by centrifugation 
at 4°C, and then lysed with a French press 
(JNBio) in buffer A (20 mM Tris-HCl, pH 8.0, 
500 mM imidazole, and 1 mM phenylmetha- 
nesulfonyl fluoride) for Strep-FXR1-10xHis 
proteins. Bacterial lysate was clarified by cen- 
trifugation at 16,000g for 1 hour. The superna- 
tant of recombinant FXR1 protein was loaded 
onto a HisTrap HP column (Cytiva). After ex- 
tensive washing with buffer A, the target 
protein was eluted in a buffer with a gradient 
of imidazole from 25 to 500 mM. Proteins were 
further loaded onto a HiTrap Q HP anion- 
exchange chromatography column (Cytiva). 
The target protein was eluted in a buffer with 
a gradient of NaCl from 100 mM to 1 M and 
dialyzed to the final buffer containing 500 mM 
NaCl and 20 mM HEPES, pH 7.5. Each protein 
was then labeled by Abberior STAR 488 (Sigma- 
Aldrich, catalog #61048) before the in vitro 
phase-separation assay. 


TRICK reporter assay 


TRICK reporter assay was performed as de- 
scribed previously (38). In brief, the Gpd2 
TRICK reporter was constructed by engi- 
neered PP7 binding sites (PBS) or MS2 bind- 
ing sites (MBS), the binding motifs for PP7 
coat protein (PCP) and MS2 coat protein 
(MCP) (39), into the coding and 3’ untranslated 
regions of Gpd2, respectively (fig. S9A). Be- 
cause translating ribosomes can displace RBP 
binding to the coding region, PCP targeted to 
the coding region through the inserted PBS 
motif thus can act as a biosensor to distin- 
guish translated mRNAs from untranslated 
ones (39) (fig. S9A). For the Gpd2 TRICK 
reporter assay, Gpd2 TRICK reporter was 
cotransfected with the expression vectors for 
GFP-NLS-PCP, RFP-NLS-MCP, and BFP-FXRla 
or its mutant in C2C12 cells. 


Electrophoretic mobility shift assay 


Before the electrophoresis, mRNA and pro- 
tein were prepared as below. About 1000 nu- 
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cleotides of Gpd2 mRNA was transcribed by 
T7 RNA polymerase (Promega, catalog #P1300) 
and purified using the MEGAclear kit (Thermo- 
Fisher, catalog #AM1908). Wild-type or mutant 
FXRI1 proteins were expressed and purified 
as described above. Gpd2 mRNA and FXR1 
proteins were then mixed at the indicated con- 
centrations in the binding buffer containing 
100 mM NaCl and 20 mM Tris-HCl, pH 7.5, and 
incubated on ice for 20 min. In the meantime, 
1% agarose gel (BIOWEST) containing 0.5x 
TBE buffer (45 mM Tris-Borate, 1 mM EDTA) 
supplied with 1x SYBR Gold (ThermoFisher, 
catalog #S11494) was prepared. The incubated 
mixture was then subjected to the gel and run 
in 0.5x TBE buffer at 5V/cm for ~40 min. After 
the electrophoresis, the gel was scanned using 
a multi-automatic fluorescence image analysis 
system (Tanon 5200 Multi). The gray intensity 
of free and FXR1-bound Gpd2 mRNA was 
measured with ImageJ, and GraphPad Prism 
9 software was used to fit the binding curve 
of each reaction with a nonlinear regression 
method based on the Hill equation 


1 
ae oe 

K; 
t+ (#) 


where 7 is the Hill coefficient. 


f= 


In vitro phase-separation assay 


The phase-separation assay was performed 
on a homemade glass slide covered with tape- 
sealed coverslips. The purified FXR1 protein 
was assembled by diluting the protein from a 
high-salt-containing storage buffer to a vary- 
ing concentration buffer containing 20 mM 
HEPES, pH 7.5, 75 to 500 mM NaCl, + 10% 
polyethylene glycol (PEG) at the indicated pro- 
tein concentration. For co-phase separation 
assay of FXR1 and mRNA, Gpd2 mRNA (200, 
1000, or 2000 nucleotides) was first transcribed 
by T7 RNA polymerase (Promega, catalog 
#P1300), with the incorporation of Cy3-labeled 
UTP (APExBio, catalog #B8330) into the reac- 
tion system, and purified with the MEGAclear 
kit (Thermo Fisher, catalog #AM1908). The 
FXR1 protein and Gpd2 mRNA“ were then 
mixed gently at the indicated concentrations 
and diluted from a high-salt-containing stor- 
age buffer to a physiological buffer containing 
20 mM HEPES, pH 7.5, and 150 mM NaCl. Af- 
ter incubation for 5 min, images were captured 
with a FV3000 confocal microscope (Olympus) 
equipped with a superresolution module. 


FRAP assay 


FRAP experiments were performed on a white 
light laser confocal microscopes (Leica TCS 
SP8 WLL) equipped with immersion objectives 
of 63x magnification driven by the imaging 
software provided by the manufacturers. FRAP 
was performed by photobleaching the region of 
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interest with a laser at 488 nm (five repeats, 
30% intensity, dwell time 1.5 s), and images 
were captured every 5 s. For FRAP in cultured 
cells, cells were plated on a 35-mm dish with 
a glass-like polymer bottom (Cellvis, catalog 
#D35-20-1.5P). Before image capture, phenol 
red-free DMEM (Gibco, catalog #21063-029) 
supplemented with 10% FBS was used in lieu 
of the cell culture. The images were also taken 
every 5 s after the region of interest was pho- 
tobleached. The fluorescence intensity of the 
bleaching region was analyzed and determined 
using the manufacturer’s software. The data 
were then normalized and plotted to the time 
point before the bleaching. 


IP, immunoblotting, and immunofluorescent 
microscopy 


For IP microscopy, mouse testicular tissue or 
the indicated cells were homogenized in ice- 
cold lysis buffer containing 50 mM Tris-HCl, 
pH 7.4, 1% Triton X-100, 150 mM NaCl, 5 mM 
EDTA, and protease inhibitor cocktail, and 
then the tissue or cell extracts were incubated 
with antibody-coupled Protein A/G beads 
(Sigma-Aldrich) according to the manufac- 
turer’s instructions. For immunoblotting, IP 
pellets or tissue or cell extracts were mixed 
with SDS loading buffer and subjected to the 
standard SDS-polyacrylamide gel electropho- 
resis and Western blotting procedures, and 
immunoblotting images were captured with a 
model 5200 chemiluminescent imaging sys- 
tem (Tanon). For immunofluorescent micros- 
copy, enriched male germ cells or testicular 
sections were fixed with 4% paraformalde- 
hyde, permeabilized with 0.5% Triton X-100 
in PBS, and then incubated with primary anti- 
bodies and Alexa Fluor 488- or Cy3-conjugated 
secondary antibodies and/or with peanut 
agglutinin (PNA) conjugated with Alexa Fluor 
488/647 (Molecular Probes), with nuclei coun- 
terstained with 4’,6-diamidino-2-phenylindole 
(DAPI; Vector Laboratories). Images were 
captured using an FV3000 confocal micro- 
scope (Olympus), and immunostaining den- 
sity was quantified using the FV31S-SW Viewer 
and ImageJ software. 


Histological, TEM, and apoptosis assays 


For histological assay, 5-um paraffin-embedded 
mouse testicular or epididymal sections were 
prepared as described previously (50) and then 
stained with hematoxylin and eosin (H&E). 
The stages of the seminiferous epithelium 
cycle and of spermatid development were 
determined as described previously (50). For 
TEM, testicular sections were fixed in 4% 
paraformaldehyde containing 0.05% glutar- 
aldehyde in 0.1 M phosphate buffer and then 
postfixed in 1% osmium tetroxide. Dehydra- 
tion was performed in ethanol, and the sam- 
ples were embedded in Epon 812. Ultrathin 
sections were counterstained with uranyl ace- 
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tate and lead citrate and examined with a TEM 
(FEI Tecnai G2 Spirit). Apoptosis analysis was 
performed using the In Situ Cell Death Detec- 
tion Kit, TMR red (Roche) according to the 
manufacturer’s instructions. 


Fluorescence in situ hybridization coupled with 
immunofluorescent assays 


In situ hybridization was performed with a 
fluorescence in situ hybridization (FISH) kit 
(RiboBio, Guangzhou, China). The isolated 
male germ cells were fixed with 4% para- 
formaldehyde for 10 min, followed by three 
washes with 1x PBS and incubation with 
prechilled permeabilization buffer containing 
0.5% Triton X-100 in PBS at 4°C for 5 min. 
After three washes with 1x PBS for 5 min, cells 
were incubated with prewarmed prehybrid- 
ization solution at 37°C for 30 min. The Gpd2, 
Akap3, or Topli antisense oligodeoxynucleotide 
probes (RiboBio) were then individually diluted 
in the hybridization solution and incubated 
with the cells at 37°C overnight in the dark. 
The next morning, the cells were washed with 
elution buffer I (4x SSC and 0.1% Tween-20) 
three times, elution buffer II (2x SSC) once, 
elution buffer III (1x SSC) once, and 1x PBS 
once for 5 min at 42°C. For an additional 
immunofluorescent assay, cells were rinsed 
with 1x PBS for 15 min and then subjected to 
immunofluorescent staining with anti-FXR1 
and anti-DDX4 antibodies at 4°C overnight. 
After three washes with 1x PBS, cells were 
incubated with Alexa Fluor 488-labeled anti- 
mouse and 647-labeled anti-rabbit secondary 
antibody and DAPI in blocking buffer con- 
taining 10 mg/ml bovine serum albumin (BSA), 
0.5% normal donkey serum (NDS), and 1x 
PBS at room temperature for 2 hours, and 
then washed with 1x PBS three times for 15 min 
each. Images were captured with an FV3000 
confocal microscope (Olympus). Immunos- 
taining density was quantified using the 
FV31S-SW Viewer and ImageJ software. 


RNA isolation, RT-qPCR, Absolute qPCR, 
and RIP assays 


The assays were performed as we described 
previously (47, 51). Briefly, total RNAs were 
extracted with RNAiso Plus (Takara) from the 
indicated cells, tissues, or polysome fractions, 
and reverse transcription (RT)-qPCR was per- 
formed using the PrimeScript RT and SYBR 
Green kits (Takara) on a QuantStudio 3 system 
(Applied Biosystems) according to the manu- 
facturer’s instructions. For absolute quantifi- 
cation of specific mRNA, a standard curve 
method was adopted. Plasmid containing the 
Fxri-coding region was linearized. The molec- 
ular number of plasmid was calculated accord- 
ing to the following formula: copy number 
(copies/u1L) = 6.0210” (copies/mol) x plasmid 
concentration (g/uL)/plasmid molecular weight 
(g/mol). The plasmid was then diluted into 
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eight gradients of ~10”° to 10? copies/1L. qPCR 
of standards and extracted total RNAs were 
performed using the PrimeScript RT and SYBR 
Green kits (Takara) on a QuantStudio 3 system 
(Applied Biosystems). The standard curve was 
plotted, and the absolute amount of specific 
mRNA in total RNA was extrapolated accord- 
ing to the standard curve. For the RIP assay, 
cells or tissues were homogenized in lysis buffer 
containing 100 mM KCl, 5 mM MgClo, 10 mM 
HEPES, 0.5% and NP-40 containing 10 U/ml 
RNase inhibitor (Takara) and a protease in- 
hibitor cocktail (Roche), and extracts were in- 
cubated with anti-FXR1 antibody or control 
immunoglobulin G (IgG)-coupled Protein A/G 
for 4 hours at 4°C. After the beads were strin- 
gently washed with washing buffer containing 
50 mM Tris-HCl, pH 7.4, 0.1% Triton X-100, 
500 mM NaCl, 5 mM EDTA, protease inhibitor 
cocktail, and RNase inhibitor, RNAs were ex- 
tracted from the beads with RNAiso Plus 
(Takara) and then subjected to the RT-qPCR 
assay. The primer sequences for RT-qPCR are 
provided in data S6. The results were first 
analyzed as relative mRNA levels of the cycle 
threshold (CT) values and then converted as 
fold changes. 


Sucrose gradient analysis 


Sucrose gradient analysis of testicular extracts 
was performed as we described recently (48). 
In brief, mouse testicular tissues were homog- 
enized in extraction buffer containing 100 mM 
NaCl, 50 mM Tris-HCl, pH 7.5, 5 mM MgCl, 
1% Triton X-100, and 100 ug/ml cycloheximide, 
and then subjected to centrifugation at 13,000g 
for 2 min to remove nuclear pellets. The ob- 
tained extracts were layered over the 10 to 
60% sucrose gradients and centrifuged at 
38,000 rpm on the SW41 rotor for 2 hours at 
4°C. For puromycin treatment, tissues were 
homogenized in extraction buffer and then 
incubated at 30°C for 30 min with 500 uM 
puromycin. Twelve fractions were collected 
by a Piston Gradient Fractionator (Biocomp, 
Fredericton, Canada), and each fraction was 
analyzed by Western blot and RT-qPCR for 
the indicated proteins and RNAs. 


Spermatogenic cell isolation 


RS and elongating spermatids were isolated 
from adult C57BL/6 mouse testes using the 
STA-PUT method described previously (52). 
Briefly, the testicular tissues were dissected 
from the albuginea membrane and then washed 
three times with DMEM (Gibco). Tissues were 
then digested with 1 mg/ml collagenase IV 
(Sigma-Aldrich) at 37°C for 10 min and cen- 
trifuged at 500g. The pellet was subsequently 
digested with 0.25% trypsin (Gibco) contain- 
ing 1 mg/ml DNase I (Sigma Aldrich) at 37°C 
for 10 min and centrifuged at 500g. The pellet 
was then resuspended in DMEM and filtered 
to prepare a single-cell suspension that was 
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further injected into the sample chamber of 
the STA-PUT apparatus (ProScience, Canada); 
a linear BSA gradient (2% BSA and 4% BSA in 
DMEM) was followed. After harvesting the cell 
fractions within 2.5 hours of sedimentation, 
RS and elongating spermatids were determined 
on the basis of their morphological character- 
istics and DAPI and PNA staining pattern under 
the microscope. 


Quantification of FXR1 in spermatids 


Cytoplasmic concentrations of FXR1 in sper- 
matids were determined by quantitative 
Western blotting using recombinant purified 
FXRid protein as a reference. After washing 
twice with PBS, enriched spermatids were 
lysed with lysis buffer containing 50 mM Tris- 
HCl, pH 7.4, 1% Triton X-100, 150 mM NaCl, 
5 mM EDTA, and protease inhibitor cocktail. 
Cell lysates from defined cell counts were sub- 
jected to immunoblotting with anti-FXR1 anti- 
body (Novus, catalog #NBP2-22246) using 
various amounts of purified FXR1d protein 
for plotting the calibration curve. The amount 
of FXR1 protein per cell was calculated by the 
calibration curve and cell counts. 


Lentivirus packaging and testis transduction 


Lentivirus packaging and testis transduction 
were performed as we recently described (57). 
Briefly, lentiviral vectors and cognate packag- 
ing plasmids were cotransfected in 293T cells 
and packaged into pseudoviruses following 
standard procedures. High-titer lentivirus 
(>108 transduction units/ml) was prepared 
through ultracentrifugation, and ~10 ul of high- 
titer lentivirus was injected into the seminifer- 
ous tubule through the efferent duct using a 
sharp glass capillary with a tip diameter of 
50 um. One week after transduction of lenti- 
viral vectors, the testes were harvested for 
examining the transduced spermatids, and 
epididymides were harvested 2 weeks after 
transduction for examining the sperm pro- 
gressed from transduced spermatids. 


Proteomics and protein profiling analysis 


To identify FXR1-binding proteins in mouse 
testes, testicular tissues were homogenized 
in lysis buffer containing 50 mM Tris-HCl, 
pH 7.4, 1% Triton X-100, 150 mM NaCl, 5 mM 
EDTA, and 1x Complete Mini Proteinase In- 
hibitor cocktail (Roche) and then centrifuged 
for 10 min at 10,000g and 4°C. Anti-FXR1 or 
anti-IgG coupled Protein A/G beads were added 
to the testicular extract and incubated for 
3 hours at 4°C. After washing the beads with 
washing buffer containing 50 mM Tris-HCl, 
PH 7.4, 0.1% Triton X-100, 500 mM NaCl, 5 mM 
EDTA, and proteinase inhibitor cocktail, IP pel- 
lets were subjected to liquid chromatography- 
tandem mass spectrometry (LC-MS/MS) analysis 
with standard protocols for proteomic analysis. 
To identify the proteins in polysome fractions 
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of mouse testes, polysome fractions prepared 
through sucrose gradients were subjected to 
LC-MS/MS analysis with standard protocols 
for proteomic analysis. The protein intensities 
of all identified proteins were normalized by 
total intensity and then used for differential 
protein expression analysis using DEqMS (53). 
The proteins in the differential expression 
analysis were required to have values greater 
than zero in two replicates for both IP and con- 
trol groups to calculate the fold change and 
false discovery rate (FDR) values. For proteins 
with only one value greater than zero in either 
group, the protein intensity equal to zero was 
replaced by the minimum value of this sample 
to calculate the fold change without FDR. 


RNA-seq and polysome profiling 


Mouse testicular tissues were homogenized 
and extracted with RNAiso Plus (Takara) ac- 
cording to the manufacturer’s protocol. For 
polysome profiling, testicular extracts were 
subjected to sucrose gradient analysis, and 
the obtained polysomes were subjected to 
RNA extraction with standard protocols. The 
extracted RNA samples were then subjected to 
RNA-seq using Illumina sequencing tech- 
nology. Clean reads were first mapped to 
ribosomal RNAs (rRNAs) and transfer RNAs 
(tRNAs), and those unmapped reads were then 
mapped to the mouse genome (mm10) with 
gene annotation GENECODE V23 using STAR 
(v2.5.3a) (54) under the default parameters ex- 
cept for the setting “-outFilterMatchNmin 40.” 
The FeatureCounts program from Rsubreads 
package (55) was used to count mapped reads 
for all annotated genes for all samples. The 
reads counts of all genes in different sample 
tables were used for differential gene expres- 
sion analysis with DESeq2 (56). The cutoff 
for differentially expressed genes was set as 
FDR < 0.05 and fold change >1.5. 

The polysome profiling sequencing data 
were preprocessed in the same manner as 
RNA-seq data, and the read counts of two 
groups were used for differential translational 
efficiency analysis using the R package Xtail 
(1.15) (57). The cutoff for call differential trans- 
lational efficiency genes was set as FDR <0.01 
and fold change >1.5. 


eCLIP-seq and sequencing data analysis 


eCLIP-seq was performed as previously de- 
scribed (21, 58). In brief, testicular lysates 
prepared from adult mouse testes were sub- 
jected to UV cross-linked treatment (254 nm, 
400 mJ/cm?) three times, and then a limited 
RNase I digestion was performed before FXR1 
immunoprecipitation. After dephosphorylation, 
a barcoded RNA adapter was ligated to the 
3' end of anti-FXR1 [Ped RNAs. Samples were 
then run on standard protein gels, transferred 
to nitrocellulose membranes, and RNA-protein 
bands ranging from 75 to 150 kDa were cut 
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for extracting RNAs. The obtained RNAs were 
then reverse transcribed, and a second DNA 
adapter was ligated to the cDNA fragment 3' 
end. After cleanup, an aliquot of each sample 
was first subjected to qPCR (to identify the 
proper number of PCR cycles) and then the 
remaining cDNA samples were PCR amplified 
and size selected by agarose gel electrophoresis. 
Samples were subjected to RNA-seq analysis 
using Illumina sequencing technology. Briefly, 
eCLIP-seq libraries with distinct inline barcodes 
were demultiplexed, and the adapters in reads 
were trimmed with cutadapt (v1.18). The reads 
218 nucleotides were discarded. Reads were 
first mapped to rRNAs and tRNAs, and those 
unmapped reads were then mapped to the 
mouse genome (mm10) with gene annotation 
GENECODE V23 using STAR (54). For each 
sample, uniquely mapped reads for annotated 
genes using the FeatureCounts program (55). 
The read distribution across the genome was 
calculated using the read distribution.py mod- 
ule in the RSeQC program (59). The FXR1 
targeting genes over input controls were 
identified using DESeq2 (56) with the cut- 
offs set as FDR < 0.05 and fold change >1.5. 


GO analysis 


GO term enrichment analysis was performed 
using the R package clusterProfiler (60) with a 
Benjamini-Hochberg multiple testing adjust- 
ment under an FDR cutoff of 0.05. Enriched 
GO terms of biological process were plotted for 
visualization. 


Statistical analysis and data visualization 


The statistical tests were performed using 
the ggpubr package in R. The figures from 
computational analysis were made using the 
ggplot2 (67) and ComplexHeatmap (62) pack- 
ages, and then the Venn diagrams were made 
using the eulerr package in R. 
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Broadly neutralizing antibodies target the 


coronavirus fusion peptide 
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The potential for future coronavirus outbreaks highlights the need to broadly target this group of pathogens. 
We used an epitope-agnostic approach to identify six monoclonal antibodies that bind to spike proteins 
from all seven human-infecting coronaviruses. All six antibodies target the conserved fusion peptide region 
adjacent to the S2’ cleavage site. COV44-62 and COV44-79 broadly neutralize alpha- and betacoronaviruses, 
including severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Omicron subvariants BA.2 and 
BA.4/5, albeit with lower potency than receptor binding domain-specific antibodies. In crystal structures 
of COV44-62 and COV44-79 antigen-binding fragments with the SARS-CoV-2 fusion peptide, the fusion 
peptide epitope adopts a helical structure and includes the arginine residue at the S2' cleavage site. 
COV44-79 limited disease caused by SARS-CoV-2 in a Syrian hamster model. These findings highlight the 
fusion peptide as a candidate epitope for next-generation coronavirus vaccine development. 


oronaviruses consist of four genera that 

infect birds and mammals (7). Seven 

coronaviruses are known to cause human 

disease: the alphacoronaviruses HCoV- 

229E (human coronavirus 229E) and 
HCoV-NL63 (human coronavirus NL63), as 
well as the betacoronaviruses HCoV-OC43 
(human coronavirus OC43), HCoV-HKUI (hu- 
man coronavirus HKU1), SARS-CoV (severe 
acute respiratory syndrome coronavirus), 
MERS-CoV (Middle East respiratory syn- 
drome coronavirus), and SARS-CoV-2 (severe 
acute respiratory syndrome coronavirus 2). 
Whereas the first four coronaviruses gener- 
ally cause mild disease, the latter three have 
brought about serious outbreaks in recent 
years. In particular, the ongoing COVID-19 
pandemic caused by SARS-CoV-2 has resulted 
in more than 6 million deaths since the first 
cases were identified in 2019 (2). The cur- 
rently dominant SARS-CoV-2 Omicron BA.2, 
BA.2.12.1, BA.4, and BA.5 subvariants are at 


least partially resistant to most available vac- 
cines and antibody therapeutics (3-6). Further- 
more, two coronaviruses previously linked only 
to animal infection were recently detected in 
individuals with flu-like symptoms (7, 8). These 
developments highlight the importance of tar- 
geting conserved and functionally essential 
sites on coronaviruses. 

Coronavirus infection is a multistep process 
that involves enzymatic cleavage and rearrange- 
ment of the surface spike protein (9). The SARS- 
CoV-2 spike contains two cleavage sites: a furin 
cleavage site at the boundary of the S1 and S2 
subunits, and an S2’ site that is conserved in 
coronaviruses. The spike protein is thought to 
be cleaved at the S1-S2 site during virus as- 
sembly, leaving the S1 and S2 subunits non- 
covalently linked. During entry, the SARS-CoV-2 
spike protein uses the receptor binding domain 
(RBD) on the S1 subunit to engage angiotensin- 
converting enzyme 2 (ACE2) on target cells. 
After receptor binding, the S1 subunit is shed 


and the $2’ site is cleaved by the membrane 
enzyme transmembrane serine protease 2 
(TMPRSS2) or endosomal cathepsins (J), lead- 
ing to insertion of the fusion peptide into the 
cell membrane and viral fusion. 

Much of the protection provided by COVID-19 
vaccines arises from neutralizing antibodies that 
target the RBD (JO). Likewise, all currently 
available therapeutic monoclonal antibodies 
(mAbs) target this domain (3). However, spike 
elements that participate in the subsequent 
stages of infection involve the more complex 
$2 fusion machinery with many moving parts, 
and these elements are more conserved than 
the RBD, which so far has been capable of 
retaining or even increasing binding to ACE2 
despite a variety of mutations (77). Therefore, 
these sites are worth exploring as targets for 
novel COVID-19 vaccines and therapeutics 
that retain efficacy against new variants and 
protect against a wider range of coronaviruses. 
Progress in this direction has started with re- 
cent studies identifying several mAbs that tar- 
get the conserved stem helix (12-16) and using 
unbiased approaches to screen for mAbs of 
interest (77-20). In this study, we carried out 
a large-scale survey of the binding landscape 
of broadly reactive mAbs against coronaviruses. 


Identification of broadly reactive mAbs from 
COVID-19-convalescent donors 


To identify individuals who are likely to har- 
bor B cells that produce broadly reactive mAbs, 
we used a multiplex bead-based assay to ex- 
amine plasma samples of 142 donors from a 
previously described cohort of COVID-19- 
convalescent individuals (20). We assessed 
plasma immunoglobulin G (IgG) reactivity 
toward spike glycoproteins of the seven human 
coronaviruses: SARS-CoV-2 (Wuhan-Hu-1 strain), 
SARS-CoV, MERS-CoV, HCoV-HKU1, HCoV- 
OC43, HCoV-NL63, and HCoV-229E. Nineteen 
donors were selected for mAb isolation and char- 
acterization on the basis of plasma IgG reactivity 
to the spike proteins of SARS-CoV-2 and at least 
two other betacoronaviruses (fig. SIA). 
We next investigated human IgG* memory 
B cells (MBCs) from the selected donors by 
using a two-stage screen to prioritize isolating 
mAbs with the greatest possible breadth of 
reactivity. First, we screened supernatants 
from 673,671 stimulated IgG* B cells for binding 
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Fig. 1. Broadly neutralizing antibodies target coronaviruses associated with 
human disease. (A) Analysis of the frequency of MBCs expressing broadly 
reactive antibodies from 19 donors. Values in parentheses represent the 
percentage of SARS-CoV-2-reactive supernatants that also bind the specified 
subsets of nonsarbecovirus coronavirus spikes. A total of 10,356 MBC culture 
supernatants (50 to 100 B cells per well) were screened. n, number of MBC 
culture wells. (B) Phylogenetic relationships across the coronavirus spike 
proteins targeted by the broadly reactive mAbs were inferred by the neighbor- 
joining method in MEGAI1L using full-length amino acid sequences of CoV spike 
proteins. Branch lengths are drawn to scale, and bootstrap values from 500 
samplings are shown on the branches. The scale bar represents the number of 
amino acid substitutions per site. (©) Heatmap representing the binding of 
broadly reactive mAbs to spike proteins from coronaviruses across the alpha-, 
beta-, and deltacoronavirus genera. Hl hemagglutinin was included as a negative 
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control for mAb binding experiments, and area under the curve (AUC) values for 
each antigen are shown after subtraction with values for the negative control 
antigen CD4. (D) Values represent antibody titer at 50% neutralization (NT50) 
against SARS-CoV-2 (Wuhan-Hu-1 strain), SARS-CoV (indicated as “CoV-1" 

in the figure), MERS-CoV, HCoV-NL63, and HCoV-229E envelope-pseudotyped 
entivirus, as well as authentic HCoV-OC43. For Assayscripps, Values are average 
of two experiments. For values with the + symbol, one NTs9 was determinable 
and one was not (i.e., >100 ug/ml), and the determinable NTso is shown. 
egative control mAbs were anti-CoV-2 RBD CV503 (for OC43 assay) (20), 
anti-influenza HA CR9114 (for Assayniy except OC43) (42), and anti-dengue DEN3 
(for Assayscripps) (43). NTso values were calculated using the dose-response- 
inhibition model with five-parameter Hill slope equation in GraphPad Prism 9. 
(E) Neutralization of SARS-CoV-2 variants of concern (pseudovirus, Assayscripps) 
by COV44-62 and COV44-79. 


to the coronavirus spike panel used in the 
plasma screen. Supernatants from only 2% 
(n = 211) of the MBC culture wells met our 
criteria for broad reactivity by binding at 
least three betacoronavirus spike proteins 
(Fig. 1A). Next, we developed an optofluidics 
assay to isolate individual MBCs of interest 
using the Berkeley Lights Beacon system 
(fig. S1B). Candidate MBCs identified in the 
supernatant screen were sorted individually 
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into nanoliter-volume pens and assessed in 
real time for secretion of mAbs that bound to 
beads coated with a cocktail of MERS-CoV and 
HCoV-O0C43 spikes, followed by beads coated 
with SARS-CoV-2 spike. Double-positive MBCs 
were exported for single-cell reverse transcrip- 
tion polymerase chain reaction and antibody 
expression as recombinant IgG1. In total, we 
obtained 60 IgG mAbs with reactivity to at 
least three coronaviruses. 


To fully examine their breadth, we tested 
the 60 mAbs for binding to spikes from the 
seven human coronaviruses. Only six mAbs— 
COV91-27, COV44-62, COV44-79, COV77-04, 
COV77-39, and COV78-36—bound to spike 
proteins from all seven coronaviruses (Fig. 1, 
B and C). Notably, four of the six also bound 
to spike from two new coronaviruses that have 
recently been associated with human disease: 
canine CoV HuPn-2018 (CCoV-HuPn-2018) and 
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porcine deltacoronavirus 0081-4 (PDCoV-0081- 
4) (7, 8) (Fig. 1C). The six broadly reactive mAbs 
were isolated from four different donors and 
were encoded by four different heavy-chain 
variable (VH) genes (VH1-2, VH3-30, VH4-59, 
and VH5-10-1) (table S1). Five mAbs were highly 
mutated, with VH nucleotide mutation frequen- 
cies ranging from 10 to 13% (fig. SIC). Given that 
these mAbs were isolated from COVID-19- 
convalescent individuals in New York ~1 month 
after the first outbreak in March 2020, these 
mutation levels suggest that the B cells were 
primed during an earlier seasonal coronavirus 
infection and possibly reactivated during SARS- 
CoV-2 infection. 


COV44-62 and COV44-79 broadly 

neutralize coronaviruses 

We assessed the neutralizing potency of the six 
mAbs against SARS-CoV-2, SARS-CoV, MERS- 
CoV, HCoV-NL63, and HCoV-229E envelope 
pseudotyped viruses, as well as authentic 
HCoV-0C43. COV44-62 and COV44-79 showed 
the broadest functional reactivity, neutralizing 
the betacoronaviruses SARS-CoV-2, SARS-CoV, 
and HCoV-OC48, as well as the alphacorona- 
viruses HCoV-NL63 and HCoV-229E (Fig. 1D 
and fig. S1D). Moreover, both mAbs neu- 
tralized SARS-CoV-2 variants of concern, 
including the Omicron BA.2 and BA.4/5 sub- 
variants, as well as authentic SARS-CoV-2 


(Fig. 1E and fig. SIE). COV44-62 also neutral- 
ized MERS-CoV, whereas no other mAbs neu- 
tralized this virus within the concentrations 
tested. 


Broadly reactive mAbs target the coronavirus 
fusion peptide 

To determine the domain of SARS-CoV-2 spike 
that was targeted by the six broadly reactive 
mAbs, we assessed mAb binding to the SARS- 
CoV-2 S2 subunit as well as the RBD and N- 
terminal domain (NTD) of the S1 subunit. All 
six mAbs bound only to the S2 subunit (Fig. 2A). 
VJ germline-reverted versions of the broadly 
neutralizing mAbs COV44-62 and COV44-79 
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Fig. 2. Broadly reactive mAbs target the same region within the SARS-CoV-2 
$2 subunit. (A) Titration curves for mAb binding to selected regions within the 
SARS-CoV-2 spike protein: the RBD, NTD, and S2 subunit. Interconnected data 
points are shown without curve fitting. L9 is a malaria-specific mAb used as a 
negative control (44). MFI, median fluorescence intensity. (B) On rates, off rates, and 
dissociation constants of the six fusion peptide Fabs for binding to SARS-CoV-2 
prefusion-stabilized spike (2P) with an unmodified furin cleavage site and the 
nonstabilized S2 subunit. k3, association rate constant: kg, dissociation rate constant; 
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showed weaker binding to S2 than the mutated 
versions (fig. S2A), suggesting that somatic mu- 
tations were important for improving binding 
to the target site. A surface plasmon resonance 
(SPR) kinetics assay determined the binding 
affinity of antigen-binding fragments (Fabs) 
derived from these mAbs for prefusion-stabilized 
whole SARS-CoV-2 spike (2P, with an intact 
S1-S2 cleavage site) and the unmodified $2 
subunit. The Fabs bound with low-to-moderate 
nanomolar affinity to both proteins, but their 
affinity for the S2 subunit was 3- to 76-fold 
higher than their affinity for whole spike (Fig. 2B 
and fig. S2B). There were no substantial dif- 
ferences between the six Fabs in their affinity 
for the S2 subunit. The mAbs showed reduced 
binding to a form of S2 that had been sta- 
bilized with two proline mutations (S2-2P) and 
bound more poorly still to a further stabilized 
version with six proline mutations (S2-HexaPro) 
(Fig. 2C). SPR-based competition showed that 
the six mAbs competed for the same binding 
site on the S2 subunit (Fig. 2D), but none com- 
peted with S2P6, a control mAb targeting the 
stem helix region (12), indicating their specific- 
ity for a distinct site on S2. 

To further investigate the specificity of these 
mAbs, we performed SPR-based peptide 
mapping using an array of 15-amino acid 
(15-mer) overlapping peptides that spanned 
the entire SARS-CoV-2 S2 subunit (S686 to 
K1211, accession no. YP_009724390.1). All 
six mAbs bound to peptides 42 to 44, which 
share the g;;RSFIEDLLF 93 motif (Fig. 3A). 
This motif is located within the SARS-CoV-2 
fusion peptide region, directly C-terminal 
to the S2’ cleavage site. To determine the 
diversity of this region across corona- 
viruses, we selected 34 viral isolates repre- 
senting each of the four coronavirus genera 
(Fig. 3B). Nearly all amino acid positions in 
the g;;RSFIEDLLF,23 motif were conserved 
in >85% of viruses selected; the exception 
was Fg 7, which was conserved in <50% of 
isolates examined (Fig. 3, B and C). The 
fusion peptide appears partially surface- 
exposed in a range of coronavirus spike 
proteins, including those of SARS-CoV-2, 
SARS-CoV, MERS-CoV, and MHV (mouse 
hepatitis virus) (2/, 22) (Fig. 3C and fig. S3A). 
However, antibody access to this site may be 
partially occluded by the S1 subunit on an 
adjacent protomer, consistent with stronger 
binding to the S2 subunit relative to the 
SARS-CoV-2 spike (Fig. 2B and fig. S3B). 

The mAb specificity toward the fusion pep- 
tide is consistent with their reduced binding 
to the HexaPro S2 construct (Fig. 2C), which 
includes a nonconservative F817P mutation at 
this site (23). To identify key amino acids for 
mAb binding, we performed an alanine scan 
on a peptide encompassing residues 810 to 
830 and focused on residues targeted by the 
broadly neutralizing mAbs COV44-62 and 
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COV44-79 (Fig. 3D). Four amino acids—E819, 
D820, L822, and F823—were important for 
binding of COV44-62, where mutation of the 
F823 residue abolished binding. Similarly, 
residues critical for the binding of COV44-79 
were E819, D820, and F823 but also included 
R815 at the S2’ cleavage site (Fig. 3D). All five 
residues identified as important for COV44- 
62 or COV44-79 binding are among the most 
conserved residues in the coronavirus spike 
protein. D820 and L822 are completely con- 
served, whereas R815, E819, and F823 are 
conserved in 33 of 34 coronaviruses (Fig. 3, 
B and E). Amino acid mutations at the pep- 
tide level may have different effects from 
mutations in the intact spike protein, where 
modified interactions with surrounding resi- 
dues may also affect antibody binding. There- 
fore, we screened the six mAbs using a shotgun 
alanine mutagenesis approach, whereby every 
amino acid in the S2 subunit of intact spike 
was individually mutated to generate a panel 
of spike mutants (Fig. 3F and fig. $3, C and D). 
In general, this assay identified a greater num- 
ber of residues as important for mAb binding, 
including some with a more intermediate 
phenotype. For COV44-62, D820, L822, and 
F823 were again crucial for binding, and K825, 
D830, and R815 were also identified as im- 
portant (Fig. 3F). For COV44-79, the results 
closely matched the peptide alanine scan, with 
the same four amino acids (R815, E819, D820, 
and F823) identified as the most critical. When 
the six mAbs were analyzed as a group, we 
found that only the four broadest neutraliz- 
ing mAbs were negatively affected by the 
R815A mutation (fig. S4, A and B), suggesting 
that binding to the S2’ cleavage site may be a 
distinguishing property of broadly neutralizing 
mAbs against this site. 


Crystal structures of anti—fusion 
peptide antibodies 


To elucidate the molecular characteristics of 
anti-fusion peptide antibodies that neutralize 
SARS-CoV-2, the Fabs of the three broadest 
neutralizing mAbs (COV44-62, COV44-79, and 
COV91-27) were complexed with 15-mer pep- 
tides containing the fusion peptide sequence 
(Fig. 4). Crystal structures were determined to 
146-, 2.8-, and 2.3-A resolution, respectively 
(Fig. 4, fig. S5, and table S2). Fourteen of the 15 
peptide residues were visible in the electron 
density map for COV44-62 (fig. S5A), 13 of 
which have a buried surface area (BSA) >0 A? 
in complex with antibody. For COV44-79, 12 of 
the 15 peptide residues were visible (fig. S5A), 
10 of which have a BSA >0 A”. Similarly, in 
COV91-27, 12 peptide residues had interpretable 
density (Fig. 4C and fig. S5A), with nine ex- 
hibiting a BSA >0 A®. The fusion peptide forms 
a helix as in the prefusion state of the SARS- 
CoV-2 spike (fig. S5B). All three complementarity- 
determining regions (CDRs) of the heavy chain 


(HC) of all three Fabs are involved in peptide 
recognition, whereas CDR1 and CDR3 of the 
light chain (LC) of COV44-62, and only LCDR3 
of COV44-79 and COV91-27, contact the peptide 
(Fig. 4, A to C). The BSA on each Fab is 
dominated by the HC and is 791 A” for COV44- 
62 (627 A” by HC and 164 A” by LC), 634 A” for 
COV44-79 (505 A” by HC and 129 A” by LC), and 
573 A? for COV91-27 (447 A? by HC and 126 A” 
by LC). The fusion peptide makes side-chain 
and backbone H-bonds and salt bridges with 
COV44-62 mainly through K814, R815, E819, 
D820, L822, F823, and N824, and hydropho- 
bic interactions through 1818, L822, and F823 
(Fig. 4D). These residues include the key resi- 
dues (E819, D820, L822, and F823) identified 
by site-directed mutagenesis (Fig. 3D). The fu- 
sion peptide did not form as many interactions 
with COV44-79 and COV91-27 (Fig. 4, E and F). 
However, R815, E819, and D820 contributed 
H-bonds and salt bridges, and 1818, L822, and 
F823 made hydrophobic interactions. In all 
three antibodies, R815, S816, 1818, E819, D820, 
L822, and F823 contributed the most BSA to 
the interaction (Fig. 4, G to I). There was par- 
tial overlap between the residues of COV44-62 
and COV44-79 that interacted with the fusion 
peptide and those that were mutated from the 
germline (fig. S6), consistent with the reduc- 
tion in binding of germline-reverted versions 
of these mAbs (fig. S2A). The structural results 
are consistent with the peptide and spike pro- 
tein mutagenesis data that identify the key 
binding residues (Fig. 3, D to F, and fig. S3D). 
Notably, the arginine at the S2’ cleavage site is 
involved in recognition by these anti-fusion 
peptide antibodies. Although the antibodies 
all interact with one face of the fusion peptide 
helical structure (fig. S5D), their approach 
angles to the peptide differ. Superimposition 
of the fusion peptide structures onto an intact 
SARS-CoV-2 spike trimer structure in the 
prefusion state showed a potential clash with 
the S protein, suggesting that a conformational 
change or conformational dynamics around 
the fusion peptide is required to accommo- 
date antibody targeting (fig. S5C). Nevertheless, 
these antibodies have neutralization activity 
against SARS-CoV-2 and therefore are able to 
interact with the fusion peptide on the virus 
(Fig. 1D). 


Response to the fusion peptide after 
vaccination and infection 


We compared the binding of polyclonal IgG 
from mRNA-1273-vaccinated donors (fig. S7A), 
COVID-19-convalescent individuals, and COVID- 
19-naive individuals to the SARS-CoV-2 fusion 
peptide (peptide 43) (Fig. 3A and fig. S7B). All 
COVID-19-naive donors showed minimal bind- 
ing to the peptide, indicating a minor contribu- 
tion by previous seasonal coronavirus infections 
to circulating fusion peptide-specific IgG. Al- 
though there was an increased response in 
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Fig. 3. Broadly neutralizing antibodies target the conserved fusion pep- 
tide. (A) Heatmap of SARS-CoV-2 S2 peptide array. Binding responses were 
assessed by SPR using a 15-mer peptide array with 12-amino acid overlay 
covering the entire S2 subunit. Each column within the map represents a 
single peptide. The white triangle denotes the S1-S2 cleavage site, and the 
black triangle indicates the S2' cleavage site. FP, fusion peptide; HR1, heptad 
epeat 1; C Helix, central helix; CD, connector domain; SH, stem helix; HR2, 
heptad repeat 2. (B) Sequence alignment of the fusion peptide from 34 viral 
isolates representing a diverse group of coronaviruses across four genera. 
This analysis was performed using MAFFT v7 with a BLOSUME62 scoring 
matrix and the L-INS-I algorithm. Single-letter abbreviations for amino 
acids: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; 
, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 

Y, Tyr. (©) Sequence conservation of prefusion SARS-CoV-2 spike protein (PDB 
ID 6VSB); the fusion peptide (amino acids 816 to 843) is outlined in black. The 
inset shows a magnified view of this region. (D) Alanine scan evaluating the 
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binding of COV44-62 and COV44-79 to the SARS-CoV-2 fusion peptide. 
Responses were normalized to the wild-type sequence. A cutoff of 20% (brown 
dashed line) was used to identify residues that were critical for binding. 

(E) Sequence logo plot of diversity within the fusion peptide region of 
coronaviruses from 34 isolates, built using WebLogo 3. Letter height is 
proportional to the probability of an amino acid at a given position, and amino 
acid residues are colored by charge. Narrow stacks (amino acids) indicate 
deletions or gaps in the sequences. Numbering is based on the SARS-CoV-2 
Wuhan-Hu-1 sequence. The key residues in the epitope footprints of mAbs 
COV44-62 (red) and COV44-79 (blue), based on peptide alanine scanning, are 
highlighted above the logo plot. (F) Amino acids critical for the binding of 
COV44-62 and COV44-79, identified by shotgun alanine mutagenesis of S2 
residues on whole spike protein. Only fusion peptide residues are shown. Key 
residues were identified on the basis of a <20% signal relative to wild-type spike 
(brown dashed line), with no corresponding loss of signal for a control mAb, 
which targets the spike protein but does not bind to this site (fig. S3C). 
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Fig. 4. Crystal structures of COV44-62, COV44-79, and COV91-27 in complex with SARS-CoV-2 
fusion peptide. (A to C) Overall interactions of (A) COV44-62, (B) COV44-79, and (C) COV91-27 with the 
fusion peptide. Fabs are shown in molecular surface representation, and the CDRs and peptides are 
represented as tubes. Cyan and yellow denote the heavy and light chains of the Fabs, respectively. Peptides 
are shown in green. H1, H2, H3, Ll, and L3 denote CDRs in the heavy (H) and light (L) chains. The 
resolutions of the crystal structures are 1.46, 2.8, and 2.3 A for the COV44-62, COV44-79, and COV91-27 
complexes, respectively. Peptide residues observed in the crystal structures are in bold; residues that interact 
with antibodies (BSA >0 A’) are in red. (D to F) Details of the interactions between (D) COV44-62, 

(E) COV44-79, and (F) COV91-27 with the fusion peptide. Vy and V, indicate the variable domains of the 
heavy and light chains, respectively. Kabat numbering was used for the Fabs; numbering in the native 
spike protein was used for the fusion peptide. Colors for the heavy chain, light chain, and fusion peptide are 
as in (A). (G to 1) BSA (gray) and accessible surface area (white) of each residue of the fusion peptide in 
complex with antibody are shown in the stacked bar chart. Residues that form polar interactions with 
COV44-62, COV44-79, and COV91-27 are denoted atop the corresponding bar with “H” for a hydrogen bond 
or “S” for a salt bridge. Buried and accessible surface areas were calculated with PISA (45). 


several vaccine recipients after the second dose 
(P = 0.025), this was not enhanced by ad- 
ministration of a booster dose. The COVID-19- 
convalescent donors did not have significantly 
higher responses than vaccinated donors after 
the second dose (P = 0.864). However, several 
convalescent donors had the highest re- 
sponses in all three cohorts, suggesting that 
SARS-CoV-2 infection triggers a strong fusion 
peptide-specific antibody response in some 
individuals. 
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COV44-62 and COV44-79 inhibit 

membrane fusion 

Spike-mediated cell fusion relies on insertion 
of the fusion peptide into the target cell mem- 
brane and thus might be inhibited by antibody 
binding to the fusion peptide. Consistent with 
this, COV44-62 and COV44-79 inhibited the 
fusion of cells expressing SARS-CoV-2 spike 
and cells expressing the ACE2 receptor in an 
imaging-based assay (Fig. 5A). We further 
tested the six fusion peptide-specific mAbs 


in a more quantitative assay wherein fusion 
would trigger the release of an enzyme that 
cleaves a chromogenic substrate. Only the 
mAbs that neutralized SARS-CoV-2 were able 
to strongly inhibit fusion (Fig. 5B). 


COV44-79 limits disease in the Syrian 
hamster model 


We evaluated the in vivo efficacy of COV44-62 
and COV44-79 against SARS-CoV-2 infection in 
the Syrian hamster model, a well-established 
model that recapitulates features of moderate- 
to-severe COVID-19 in humans (24-26). We 
converted the Fc regions of the two mAbs to 
hamster IgG2 to allow optimal Fc function. 
The mAbs were administered intraperitoni- 
ally at 16 mg per kilogram of body weight 
(mg/kg), followed by intranasal administra- 
tion of 5 log;> plaque-forming units of SARS- 
CoV-2 WAI 24 hours later (Fig. 5, C and D, 
and fig. S8). Hamsters treated with COV44-79 
and, to a lesser extent, COV44-62 had a smaller 
decrease in body weight and recovered more 
quickly than untreated hamsters (P < 0.01 from 
days 3 to 7 for COV44-79; P < 0.05 from days 5 
to 7 for COV44-62) (Fig. 5C). Similar results 
were observed in a second experiment in which 
COV44-79 was tested in comparison with a 
hamster IgG2 isotype control (fig. S8B). Fur- 
thermore, semiquantitative scoring revealed 
that hamsters treated with COV44-79 had less 
interstitial pneumonia than untreated ham- 
sters on day 7 (P < 0.05) (Fig. 5D). COV44-79 
was also able to slightly reduce lung viral 
titers relative to control hamsters, on the 
basis of subgenomic RNA quantification and 
plaque assay analysis (fig. S8C). 


Discussion 


The broad conservation and functional im- 
portance of the fusion peptide highlight the 
potential of this site as a candidate for corona- 
virus vaccine development. These findings 
have parallels to studies of HIV-1 gp120, where 
the surface-exposed fusion peptide was iden- 
tified as a target of neutralizing mAbs (27). 
This discovery led to the investigation of the 
HIV-1 fusion peptide as a candidate immuno- 
gen to elicit broadly neutralizing antibodies in 
animals (28); subsequent animal studies have 
increased the potency and breadth of the 
fusion peptide-targeting antibody response 
(29). Despite these potential advantages, the 
coronavirus fusion peptide has not been a 
major focus for development of therapeutic 
mAbs and COVID-19 vaccines. The main draw- 
back of the mAbs described here is their com- 
paratively low in vitro neutralization potency. 
These mAbs fit into a wider trend of a trade-off 
between potency and breadth: Highly potent 
mAbs that target the RBD are restricted to 
sarbecoviruses and most do not neutralize all 
variants of SARS-CoV-2 (3, 5, 6, 20, 30), whereas 
mAbs that target the stem helix (12-16) and 
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Fig. 5. COV44-62 and COV44-79 inhibit SARS-CoV-2 spike—mediated 
fusion and COV44-79 limits disease in a Syrian hamster model. (A) Images 
of fusion between HeLa cells stably expressing SARS-CoV-2 spike (red 
fluorescent protein) and HeLa cells stably expressing the ACE2 receptor (green 
fluorescent protein) after counterstaining with Hoechst (blue). Cells were 
cocultured in the presence of COV44-62 or COV44-79, or without a mAb 
(control). Scale bars, 500 um. (B) Fusion inhibition of six fusion peptide—specific 
mAbs in a quantitative assay. (C) Weight change for SARS-CoV-2-naive 
animals versus virus-exposed animals that were mock-treated or treated with 
16 mg/kg of mAb. Statistical significance for average body weight was analyzed 


across the 7-day time course using a mixed-effects repeated measures model 
with Dunnett's post-test multiple comparison (n = 12 animals from days 0 to 3; 
n = 6 animals from days 4 to 7). Error bars show mean + SD. (D) Pathology 
scores for SARS-CoV-2-naive animals versus virus-exposed animals that were 
mock-treated or treated with 16 mg/kg of mAb. Scores for interstitial pneumonia 
pathology (days 3 and 7) based on gross pathology observations were 
statistically analyzed by a Kruskal-Wallis test with Dunn's post-test multiple 
comparison (n = 6 to 12 animals per condition) between the mAb-treated and 
mock-treated groups on each day. Bars show median and interquartile range. 
*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001; ns, not significant. 


those identified here have greater breadth 
but are less potent. However, as previously 
reported for at least one antistem helix mAb 
(15), COV44-79 performed better than expected 
in the hamster model, which suggests that it 
may function in a way that is not captured 
effectively in the neutralization assay. For 
instance, Fe effector functions may enhance 
activity, as observed with anti-SARS-CoV-2 
antibodies with other specificities (31-33). 


734 12 AUGUST 2022 * VOL 377 ISSUE 6607 


Moreover, there is substantial scope for im- 
provement for mAbs with this specificity, and 
subsequent studies may discover mAbs with 
higher potency, as with the HIV-1 fusion peptide 
(34). Techniques to improve antibody affinity 
and potency could also be useful (35, 36). 
Additionally, vaccination with fusion peptide 
constructs may trigger a polyclonal response of 
greater magnitude and potency. We found that 
three doses of the mRNA-1273 vaccine did not 


produce strong antibody responses to the 
fusion peptide, although several COVID-19- 
convalescent individuals exhibited strong anti- 
body responses to this site. This observation 
is consistent with greater exposure of the S2 
subunit to B cells during natural infection as 
a result of S1 uncoupling, which likely occurs 
less frequently with prefusion-stabilized spike 
protein. Furthermore, depletion of fusion 
peptide-specific antibodies from the serum 


science.org SCIENCE 


RESEARCH | RESEARCH ARTICLES 


of COVID-19-convalescent patients resulted 
in a 20% reduction in SARS-CoV-2 neutral- 
ization (37), and reactivity to the fusion peptide 
was correlated with neutralization titer (38), 
indicating that polyclonal antibodies to the 
fusion peptide can play an appreciable role. Thus, 
our findings are consistent with previous studies 
that have highlighted the potential utility of 
the fusion peptide as a target epitope (37-41), 
and the fusion peptide-targeted mAbs provide 
additional tools to combat COVID-19 and enhance 
pandemic preparedness. 
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ACE2-binding exposes the SARS-CoV-2 fusion peptide 
to broadly neutralizing coronavirus antibodies 
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The coronavirus spike glycoprotein attaches to host receptors and mediates viral fusion. Using a broad 
screening approach, we isolated seven monoclonal antibodies (mAbs) that bind to all human-infecting 
coronavirus spike proteins from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) immune 
donors. These mAbs recognize the fusion peptide and acquire affinity and breadth through somatic 
mutations. Despite targeting a conserved motif, only some mAbs show broad neutralizing activity in vitro 
against alpha- and betacoronaviruses, including animal coronaviruses WIV-1 and PDF-2180. Two 
selected mAbs also neutralize Omicron BA.1 and BA.2 authentic viruses and reduce viral burden and 
pathology in vivo. Structural and functional analyses showed that the fusion peptide-specific mAbs 
bound with different modalities to a cryptic epitope hidden in prefusion stabilized spike, which became 
exposed upon binding of angiotensin-converting enzyme 2 (ACE2) or ACE2-mimicking mAbs. 


he Orthocoronavirinae subfamily of 
coronaviruses comprises four genera: 
Alphacoronavirus, Betacoronavirus, 
Gammacoronavirus, and Deltacoronavirus. 
Alphacoronaviruses NL63 and 229E and 
betacoronaviruses OC43, HKU1, severe acute 


respiratory syndrome coronavirus 2 (SARS- 
CoV-2), SARS-CoV, and Middle East respi- 
ratory syndrome (MERS)-CoV account for 
almost all human coronavirus (hCoV) infec- 
tions, with sporadic infections attributed to 
canine alphacoronavirus CCoV-HuPn-2018 
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and porcine deltacoronavirus (PDCoV) (J-3). 
The spike (S) glycoprotein, which binds to 
various host receptors for viral entry (table 
SI), is composed of the S, and S, subunits, is 
highly divergent, with only ~30% sequence 
identity between alpha- and betacoronavi- 
ruses, and is the main target of neutralizing 
antibodies (4-7). Previous studies have de- 
scribed neutralizing monoclonal antibodies 
(mAbs) that cross-react among sarbecovi- 
ruses by targeting the receptor-binding do- 
main (RBD) in the S, subunit (8-14) or more 
broadly across betacoronaviruses by targeting 
the stem helix in the S, subunit (75-20). How- 
ever, mAbs neutralizing both alpha- and beta- 
coronaviruses have not been reported. Broadly 
neutralizing mAbs, as exemplified by those 
targeting influenza viruses or HIV-1 (21-26), 
can potentially be used for prophylaxis or ther- 
apy and to guide the design of vaccines elicit- 
ing broadly protective immunity (27, 28). 


Isolation of broadly reactive coronavirus mAbs 
from convalescent and vaccinated individuals 


To search for antibodies that cross-react with 
human-infecting alpha- and betacoronaviruses, 
we stimulated, under limiting conditions, total 
peripheral blood mononuclear cells (PBMCs) 
from SARS-CoV-2 immune donors in the pres- 
ence of the TLR7/8 agonist R848 and interleukin-2 
(IL-2), which selectively induce the prolifera- 
tion and differentiation of memory B cells 
(29). On day 12, the specificities of immuno- 
globulin G (IgG) secreted in the culture 
supernatants were tested by enzyme-linked 
immunosorbent assay (ELISA) against a 
panel of recombinant S proteins from alpha 
and beta hCoVs (Fig. 1, A to C). The number of 
SARS-CoV-2 S IgG-positive cultures was gener- 
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ally higher in COVID-19 convalescent patients 
and in SARS-CoV-2 vaccinees with prior in- 
fection (preimmune) compared with vaccinees 
without prior infection (naive). Most SARS- 
CoV-2 S IgG-positive cultures were either mono- 
specific or cross-reactive with the closely related 
SARS-CoV S, whereas a small fraction was also 
reactive with OC43 and HKU1S (Fig. 1, A to D), 
consistent with previous serological analyses 
(17, 30, 31). Six cultures (of >4000) from five 
individuals (of 43) cross-reacted with all alpha 
and beta hCoV S proteins tested (Fig. 1, A to 
D), suggesting that memory B cells producing 
broadly reactive coronavirus antibodies might 
exist at very low frequency. 

To isolate broadly reactive coronavirus mAbs, 
we combined the screening of polyclonally 
activated memory B cells with the sorting and 
cloning of antibody-secreting cells to retrieve 
paired heavy- and light-chain sequences (fig. S1). 
Using this approach, we isolated 16 SARS-CoV-2 
S-specific mAbs that cross-reacted with various 
hCoV S proteins (Fig. 1E and fig. S2A). Six mAbs 
(group 1) cross-reacted with the betacoronavi- 
ruses SARS-CoV, SARS-CoV-2, OC43, and HKU1 
S proteins, with median effective concentration 
(ECs) values ranging from 45 to 3000 ng/ml. 
Three mAbs (group 2) cross-reacted with high 
affinity to all beta hCoV S proteins, with ECs, 
values ranging from 18 to 40 ng/ml (Fig. 1E). 
These mAbs were found to target the stem helix 
region and have been described in a separate 
study (15). The remaining seven mAbs (group 3) 
exhibited the broadest cross-reactivity to both 
alpha and beta hCoV S proteins, with ECs9 
values ranging from 29 to 800 ng/ml (Fig. 1E). 
These broadly reactive mAbs, which are the 
focus of the present study, were isolated from 
convalescent or vaccinated individuals, used 
different V genes (except for C13B8 and C13A7, 
which are clonally related), and displayed a high 
load of somatic mutations (7 to 14% in VH, 2 to 
8% in VL at the nucleotide level; table S2). These 
results illustrate the utility of a simple high- 
throughput method based on multiple parallel 
screening steps of memory B cells to isolate 
broadly reactive coronavirus mAbs. 


Broadly reactive mAbs bind to the fusion 
peptide and acquire affinity and breadth 
through somatic mutations 


Using SARS-CoV-2 S;, RBD, and S, protein do- 
mains, as well as 15-mer linear peptides cov- 
ering the entire S sequence, the specificities of 
all seven group 3 mAbs were mapped to the 
K811PSKRSFIEDLLFNK825 sequence in the 
SARS-CoV-2 S, subunit (Fig. 2A). This sequence 
spans the S,’ cleavage site (R815) and the fusion 
peptide N-terminal region, which is essential 
for membrane fusion (32) and is highly conserved 
among all genera of the Orthocoronavirinae 
subfamily, including deltacoronavirus PDCoV 
and gammacoronavirus IBV, as well as all 
SARS-CoV-2 variants sequenced to date (fig. 


S3, A and B). Most of the seven group 3 mAbs 
bound more tightly to the prefusion SARS- 
CoV-2 S trimer without stabilizing proline 
substitution in the fusion peptide (PentaPro) 
relative to HexaPro (F817P) (33), and all of 
them bound with greater avidity to pre- 
fusion SARS-CoV-2 S, compared with post- 
fusion S, (Fig. 2A and fig. S2D). 

To explore the ontogeny of the isolated fu- 
sion peptide-specific mAbs, we compared the 
binding properties of mature mAbs to their un- 
mutated common ancestors (UCAs) (Fig. 2B 
and fig. S2B). The UCAs of C13B8, C13A7 (the 
two clonally related mAbs), and VNO1HI1 ex- 
hibited broad reactivity with hCoV S proteins 
but had lower affinity than their mature 
counterparts. By contrast, the C13C9 UCA 
bound only to the beta hCoVs OC43 and MERS- 
CoV S, whereas the VP12E7 UCA bound only to 
the alpha hCoV NL63 S, with low affinity in 
both cases. Finally, the UCAs of C28F8 and 
C77G12 did not bind to any hCoV S proteins 
tested. Given the lack of a common V gene usage, 
these findings suggest that broadly reactive fu- 
sion peptide-specific mAbs can mature through 
multiple pathways and acquire high affinity and 
cross-reactivity through somatic mutations, pos- 
sibly because of priming by endemic coronavirus 
infection followed by SARS-CoV-2 boosting upon 
infection or vaccination. 

Considering the high conservation of the fu- 
sion peptide region, we next assessed whether 
broad reactivity is a property shared by most 
fusion peptide-specific mAbs. IgGs secreted 
upon polyclonal activation of memory B cells 
from 71 convalescent individuals were screened 
for binding to a pool of alpha and beta hCoV 
fusion peptides, as well as to SARS-CoV-2 S protein. 
Cultures producing fusion peptide-specific mAbs 
were detected at low frequency and only in 19 
individuals (fig. S4A). Although nearly all fu- 
sion peptide-reactive mAbs bound to SARS- 
CoV-2 S, only nine of 30 were broadly reactive, 
and the remaining mAbs showed different de- 
grees of cross-reactivity (fig. S4B). Thus, although 
the coronavirus S fusion peptide is conserved 
(34-39), broad reactivity is the property of a 
minority of fusion peptide-specific mAbs, con- 
sistent with the observation that only three of 
the seven group 3 mAbs cross-reacted with the 
PDCoV and IBV fusion peptides (fig. S3C). 


Fusion peptide-specific mAbs have 
heterogeneous neutralizing activities and can 
reduce viral burden in vivo 


We next tested the neutralizing activity of 
all seven group 3 mAbs against pseudo- 
typed viruses carrying S proteins of alpha- and 
betacoronaviruses. Despite binding to the 
same motif, these mAbs exhibited dis- 
tinct neutralizing potencies. Most notably, 
VNO1H1 and VP12E7 neutralized all hCoV 
S-pseudotyped viruses tested (SARS-CoV-2, 
SARS-CoV, MERS-CoV, NL63, and 229E), as 
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Fig. 1. Rare broadly reactive A 
memory B cells are elicited 

upon natural infection or vac- 
cination. (A to D) Total PBMCs 
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(VP) (B), and vaccinees without 
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culture well (represented in 
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(in subcolumn) from each donor 
(in column). The order of the 
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6 mAbs that cross-reacted with 


well as bat sarbecovirus WIV-1 and merbe- 
covirus PDF-2180 (40) S-pseudotyped viruses 
(Fig. 2C and fig. S5, A to C). C77G12 neu- 
tralized all betacoronavirus S pseudotype 
viruses tested with the highest relative po- 
tency for SARS-CoV-2 (Fig. 2C and fig. S5, A 
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to C). The SARS-CoV-2-neutralizing activity 
of VNO1H1 and C77G12 was reduced by the 
addition of soluble fusion peptide (fig. S5D). 
Furthermore, VNO1H1, C77G12, and C28F8 
also inhibited SARS-CoV-2 S-mediated cell- 
cell fusion (Fig. 2D), suggesting that fusion 


multiple hCoV S proteins were isolated from 10 donors. Shown are ECs9 values to the respective commercially available hCoV S proteins measured by ELISA. mAbs are 
grouped on the basis of reactivity patterns. Shown is one representative experiment out of at least two performed. Group 2 mAbs C21G12, C21D10, and C21E3 were described 
in a separate study (15) (designated P34G12, P34D10, and P34E3, respectively). 


peptide-specific mAbs could prevent S proteolytic 
activation or fusogenic rearrangements, thereby 
inhibiting membrane fusion and viral entry. 
We next assessed the protective efficacy of 
the VNO1H1 and C77G12 mAbs in vivo. In the 
Syrian hamster model of SARS-CoV-2 P.1 


737 


12 AUGUST 2022 * VOL 377 ISSUE 6607 


RESEARCH | RESEARCH ARTICLES 


A B E 
ECs [ng/ml] @ SARS-CoV-2 a : 
3.2 = 1(x107) 4 3 2 1 (x10*) 4 3 2 1 (x107)3 2 1 (x10°) -——1 
=| _— @ SARS-CoV @0C43 @NL63 ae. ae 
@MERS-CoV @HKU1 © 229E ay 
C13B8 5 10") 2x6 
@. ®: @: @ 1074 6d 3 
C13A7 nee a ee ee E408 ¥ ve 
C13C9 ._ ° & He . 
C28F8 3 8 10° 
77612 ae oe Soi 
VNO1H1 o: e 
VP12E7 §e r e & 
» LQ SG SePF 9 Or Sr 9 i i i i : 
2@ Sr ew® O Ss 10° lI] rn or a a T 4 Ly rT Go 
FX PS DP oS > A % D 6B LD VAVN MA A 108 
S MF Q& Ss OX, BGS? of aK > aS SiOz Ra e 5 
& Cae SSP KY PMMA OS Oe 5 10 
RS rs? er Ne 2 10% 
Vv S Va \) S oO ©) © 
VES Re: £ YY eS an 
Ss se 8 3 10 
LISLE ° a 
oO 
C D a 
= 
5 SARS-CoV-2 
s ° o ©C13B8_CVNO1H1 
“ wn 7 
s 101 @ @ © © © © “(HEK293T-ACE2) P1007 Cigar” AVPIIET 
Bg a SARE OV? Ze _, |eciscs : 
B10? e e e @ = (HEK293T-ACE2-TMPRSS2) Pin 7 C28F8 5 
[o} 
25 . ‘ S 8 eSARs-cov 20. locr7e12 3 
22103 ‘ @ _ (HEK293T-ACE2-TMPRSS2) = < S 
®@ 5 CO = re) 
ee 4 ° oe @ MERS-CoV eS 254 £ 
8 104 ‘ (Huh7-TMPRSS2) at S 
a J 7) ‘3 
3 ; @NL63 a Oo 5 3 3 r 
6 101-4 1 1 (VeroE6-TMPRSS2) 3 et 
SF SS US S © 0220e * a 
iN N N V a‘ S* N e Ab | 
oe Oe wr (Huh7-TMPRSS2) mAb [egimil 
RY 
RS 


Fig. 2. Broadly reactive fusion peptide-specific mAbs acquire affinity and 
breadth through somatic mutations. (A) Binding profiles of the seven 

group small Group 3 mAbs. Epitope mapping was performed using commercially 
available SARS-CoV-2 protein domains (S;, RBD, and Sz), 15-mer overlapping 
synthetic peptides with the indicated sequences, prefusion HexaPro (F817P) S, 
prefusion PentaPro (F817) S, prefusion PentaPro (F817) Ss, and postfusion S. coated 
onto plastic and assessed by ELISA (logio ECso shown). The mAbs all recognized 
the fusion peptide sequence highlighted in blue. (B) Binding evaluation of 
group 3 mAbs and their respective germline reverted UCAs to different hCoV 

S proteins. C13B8 and C13A7 are sister clones and thus share a single UCA. 
One representative experiment of at least two is shown. (C) Half-maximum 
inhibitory concentrations (ICs50s) of group 3 mAbs against SARS-CoV-2, SARS- 
CoV, MERS-CoV, NL63, and 229E S-pseudotyped viruses in the indicated target 


cell lines. For each hCoV pseudotyped assay, all mAbs were compared in parallel. 
One representative experiment of at least two is shown. (D) Group 3 mAbs were 
assessed for their ability to inhibit the fusion of SARS-CoV-2 S-expressing 
(A549-S) and ACE2-TMPRSS2-expressing (A549-ACE2-TMPRSS2) A549 cells. 
Inhibition of fusion values are normalized to the percentage of fusion without 
mAb (100%) and to that of fusion of non-transfected cells (0%). One 
representative experiment of two is shown. (E) The prophylactic efficacy 

of VNO1H1 (50 mg/kg), C77G12 (25 and 50 mg/kg), and the negative control 
malaria-specific mAb MGH2 (50 mg/kg) were tested in hamsters challenged with 
the SARS-CoV-2 P.1 (Gamma) variant of concern. Viral RNA loads (top), 
replicating virus titers (middle), and histopathological scores (bottom) are 
shown. P > 0.05 (non-significant, n.s.), *P < 0.05, **P < 0.01, ***P < 0.001, 
Mann-Whitney test corrected using Bonferroni multiple comparisons. 


(Gamma) infection, prophylactic administra- 
tion of either mAb at high doses reduced viral 
RNA copies and lung titers and ameliorated 
lung pathology at statistically significant levels 
(Fig. 2E). These findings demonstrate that fu- 
sion peptide-specific mAbs can reduce viral 
burden in vivo, albeit with moderate potency. 


Structural basis for fusion peptide recognition 


To gain insight into the epitope recognized by 
fusion peptide-specific broadly reactive corona- 
virus mAbs, we performed substitution scan 
and structural analysis on the six clonally un- 
related mAbs. All mAbs bound to a core motif 
comprising I818EDLLFNK825 (fig. S6A), with 
C28F8, C77G12, VNO1H1, and VP12E7 having a 
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three-amino acid expanded footprint span- 
ning the N-terminal R815SF817 residues, which 
includes the S,' cleavage site at position R815. 
We determined the crystal structures of five 
Fabs (C13B8, C13C9, C77G12, VNO1H1, and VP12E7) 
in complex with the K811PSKRSFIEDLLFNK825 
fusion peptide at 2.1 A, 2.1 A, 1.7 A, 1.86 A, and 
2.5 A, resolution, respectively (Fig. 3; fig. S6, B 
to D; and table S3). All five Fabs bind to over- 
lapping epitopes in the fusion peptide through 
interactions involving the heavy- and light- 
chains. The strict conservation and the con- 
servative substitution of key residues involved 
in mAb recognition (R815, S816, 1818, E819, 
D820, L821, L822, F823, N824, and K825) 
across the Orthocoronavirinae subfamily ex- 


plains the broad cross-reactivity of these fusion 
peptide-specific mAbs (figs. $3 and S6 and 
table S4). The overall architecture of the fu- 
sion peptide in the C13C9-bound complex 
structure is most similar to that observed in 
the prefusion S trimer (PDB 6VXX) (4) (Fig. 3C 
and fig. S6B). In the two structures determined 
in complex with VNO1H1 or VP12E7, the 
SARS-CoV-2 S fusion peptide adopts a similar 
conformation, which is distinct from the con- 
formation observed in prefusion SARS-CoV-2 
S trimeric structures (4) (Fig. 3, B and C, and 
fig. S6C). Specifically, residues 813SKR815 
refold from an extended conformation in 
prefusion S to an o-helical conformation in 
the two Fab-bound peptide structures, thereby 
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extending the o-helix found at the N-terminal 
region of the fusion peptide. In the C77G12- 
bound complex structure, the fusion peptide 
residues P812 to R815 adopt an extended con- 
formation, distinct from prefusion S (Fig. 3, A 
and C), whereas these residues are disordered 
in the C13B8-bound structure (Fig. 3C and 
fig. S6D). The conserved residue R815, which 
is the S,’ site of proteolytic processing upon 
receptor binding for membrane fusion acti- 
vation, is engaged in electrostatic interactions 
with the C13C9, VNO1HI, VP12E7, and C77G12 
Fabs and is therefore buried at the interface 
with their paratopes (table S4). Because pre- 
incubation of a soluble, native-like SARS-CoV-2 
S ectodomain trimer with fusion peptide- 
specific mAbs did not prevent S2X58-induced 
triggering of fusogenic conformational changes 
(41, 42) (fig. 87), these mAbs likely inhibit 
TMPRSS2 cleavage of the S,' site (through 
steric hindrance) and in turn activation of 
membrane fusion. Although residue F817 is 
not part of the epitope (table S4), the F817P 
substitution present in the HexaPro construct 
likely prevents the adoption of the extended 
a-helical conformation observed in the struc- 
tures bound to VNO1H1 or VP12E7 (because of 
restricted backbone torsion angles), resulting 
in dampened binding in ELISA (Fig. 2A and 
fig. S2D). 

Superimposition of the Fab-fusion peptide 
complexes with available prefusion S structure 
(PDB 6VXX) (4, 5) revealed that the targeted 
epitope is buried toward the core of the S 
trimer and is therefore inaccessible (Fig. 3, D 
and E, and fig. S6, B to D), likely explaining 
the lack of detectable complexes of Fabs with 
prefusion S trimers during single-particle 
electron microscopy analysis. Taken together, 
these findings suggest that the epitope recog- 
nized by fusion peptide-specific mAbs is cryptic 
and may become accessible only transiently (43). 


The fusion peptide is unmasked by 
ACE2 binding 


To investigate the neutralization mechanism 
of fusion peptide-specific mAbs, we transfected 
human embryonic kidney 293T (HEK293T) 
cells to express coronavirus S proteins on their 
cell surface. All fusion peptide-specific mAbs 
showed only marginal binding to SARS-CoV-2 
S-expressing HEK293T cells compared with con- 
trol mAbs targeting the RBD (C94) (fig. S2C) or 
the stem helix (C21E3) (Fig. 4A and fig. S8A). 
However, the addition of soluble ACE2 enhanced 
binding of all fusion peptide-specific mAbs to 
SARS-CoV-2 S to levels comparable to that of 
the C94 and C21E3 control mAbs, suggesting 
that receptor engagement induces a conforma- 
tional change that exposes the cryptic fusion 
peptide epitope (Fig. 4, A and B, and fig. S8A). 
This ACE2-dependent enhancement of bind- 
ing was also observed with SARS-CoV-2 S 
harboring a deletion in the polybasic S,/S»_. 
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Fig. 3. Fusion peptide-specific antibodies target a cryptic epitope. (A and B) (Top) Surface representation of 
the crystal structures of the C77G12 (A) and VNO1H1 (B) Fabs in complex with SARS-CoV-2 fusion peptide 
epitope. (Bottom) Ribbon representation of the corresponding structures highlighting the interactions of 
Fab heavy- and light-chain complementarity-determining regions with the fusion peptide (selected regions 
are shown for clarity). (©) Ribbon representation of the fusion peptides in the Fab-bound complexes 
superimposed with the fusion peptide in prefusion SARS-CoV-2 S (PDB 6VXX). (D and E) Superimposition 
of the C77G12-bound (D) or VNO1H1-bound (E) fusion peptide structures to prefusion SARS-CoV-2 S 
uncovering the cryptic nature of the epitope. The Fabs are shown as surfaces, and S is rendered as ribbons. 
Each SARS-CoV-2 S protomer is colored distinctly (light blue, pink, and gold), and Fab heavy and light 
chains are colored purple and magenta, respectively. 
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Fig. 4. The SARS-CoV-2 fusion peptide is unmasked after ACE2 receptor locks the S trimer in a closed conformation. The line in the scatter plot is a 


engagement. (A) Binding of fusion peptide-specific mAbs VNO1H1 and C77G12 reference for the maximum geometric mean fluorescence intensity (gMFI) of 
(8 ug/ml), as well as RBD-specific C94 and stem helix-specific C21E3 mAbs staining with control anti-RBD C94 mAb. One representative experiment of two is 
(8 ug/ml) on HEK293T cells transiently cotransfected with plasmid encoding shown. (©) VNO1H1 and C77G12 in full-length IgG or scFv formats were tested 
ZsGreen and plasmids encoding full-length SARS-CoV-2 S Wuhan-Hu-1 for their neutralization of authentic SARS-CoV-2 Washington-1 and Omicron 
(GenBank: NC_045512), SARS-CoV-2 S-2P (K986P, V987P), NL63 S (GenBank: BA.1 and BA.2 variants. ICso values are displayed as a table. (D) Synergy 
APF29071.1), or 229E S (GenBank: APT69883.1) in the presence or absence experiment performed against SARS-CoV-2 (Wuhan-Hu-1) S-pseudotyped virus 
of respective recombinant receptors ACE2 or APN, as measured by flow (0.1 MO!) on VeroE6-TMPRSS2 at the indicated concentrations of S2E12 and 
cytometry. (B) Titrating doses of fluorescently labeled mAbs were coincubated C77G12 mAbs alone or in combination in a checkerboard manner. Analysis of 
with HEK293T cells expressing full-length SARS-CoV-2 S for 2 hours at room mAb synergy was performed using MacSynergy II (70). Synergy plots adjusted 
temperature in the presence or absence of recombinant ACE2-mFc (27 pg/ml), — with Bonferroni correction at 95% confidence were used for reporting. A synergy 
ACE2-mimicking mAb S2E12 (20 ug/ml), or S2M11 (20 pg/ml). The latter mAb score between 50 and 100 was defined as moderate. 


740 12 AUGUST 2022 + VOL 377 ISSUE 6607 science.org SCIENCE 


RESEARCH | RESEARCH ARTICLES 


cleavage site (APRRA) (44, 45) (fig. S8A), 
suggesting that these mAbs can bind before 
S, shedding or S fusogenic refolding to post- 
fusion state, consistent with the preferential 
binding to prefusion S, relative to postfusion 
S» (fig. S2A). However, the 2P (K986P, V987P) 
prefusion-stabilizing mutations (4, 5) that lie 
outside of the epitope (Fig. 4A and fig. S8A) 
inhibited ACE2-mediated enhancement of mAb 
binding, implying an impediment of receptor- 
induced allosteric conformational changes, 
consistent with recent findings (46). Enhanced 
binding of fusion peptide-specific mAbs was 
also observed with SARS-CoV S- and MERS- 
CoV S-expressing HEK293T cells in the pres- 
ence of their respective receptors ACE2 (47) 
and DPP4 (48). By contrast, fusion peptide- 
specific mAbs bound efficiently to HEK293T 
cells displaying the alphacoronaviruses NL63 
and 229E S independently of receptor engage- 
ment by ACE2 (49) or APN (50), respectively 
(Fig. 4A and fig. S8A). We observed that VNO1H1 
and C77G12 neutralized authentic SARS-CoV-2 
Omicron BA.1 and BA.2 more potently than 
Washington-l, likely because of increased acces- 
sibility of the fusion peptide in Omicron S (46). 
Furthermore, the neutralization potencies of 
VNO1H1 and C77G12 could be further improved 
by engineering mAbs with smaller formats (such 
as scFv), suggesting possible limited accessi- 
bility of the fusion peptide region (Fig. 4C). 

It was previously shown that certain mAbs 
can mimic ACE2 binding and trigger fusogenic 
activity (51, 52). Indeed, the addition of S2E12, 
an ACE2-mimicking mAb, but not S2M11, a 
mAb that locks SARS-CoV-2 S trimer in the 
closed state (57), enhanced binding of the 
fusion peptide-specific mAbs (Fig. 4B). Consistent 
with this finding, we observed a synergy in 
pseudovirus neutralization when C77G12 was 
used in combination with S2E12 (Fig. 4D), 
suggesting that fusion peptide-specific mAbs may 
be more effective in the context of a polyclonal 
antibody response against the RBD. Collec- 
tively, these results indicate that the fusion 
peptide epitope only becomes accessible upon 
receptor-induced S conformational changes in 
sarbecovirus SARS-CoV-2, SARS-CoV, and merbe- 
covirus MERS-CoV, whereas it is more readily 
accessible in the alphacoronaviruses NL63 and 
229E, possibly because of molecular breathing 
to different extents in these S trimers. 


Discussion 


Using a high-throughput screening of memory 
B cells from immune donors with a panel of 
S proteins, we identified a new class of mAbs 
that target the fusion peptide region. These 
mAbs have different binding modalities, some 
of which exhibit unprecedented cross-reactivity 
to the fusion peptide of all four coronavirus 
genera. The findings that the UCAs of some of 
these mAbs bind preferentially to common 
cold coronaviruses and that the mAbs acquire 
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affinity and breadth through somatic muta- 
tions suggest that their elicitation may re- 
quire multiple rounds of selection, possibly by 
heterologous coronavirus infections. A complex 
developmental pathway has been reported for 
HIV-1-neutralizing mAbs and may be a general 
requirement for mAbs that recognize highly 
conserved epitopes (53, 54). 

Previous studies have identified serum anti- 
bodies binding to the fusion peptide of SARS- 
CoV-2 and showed, through depletion or 
peptide inhibition experiments, that such 
antibodies can contribute to the serum- 
neutralizing activity in a polyclonal setting 
(34-38). We show here that some fusion 
peptide-specific mAbs have direct neutraliz- 
ing activity in vitro against alpha- and beta- 
coronaviruses and can ameliorate pathology 
and viral burden in vivo at high doses. Al- 
though the neutralizing activity of these mAbs 
is low when used alone, it is possible that in 
the context of a polyclonal response, they may 
synergize with other antibodies that favor the 
exposure of the fusion peptide region, as 
shown here with S2E12. 

The biochemical events associated with S,' 
proteolytic processing for SARS-CoV, SARS- 
CoV-2, and MERS-CoV that are critical for 
unmasking the fusion peptide region have 
been described previously (55-59). The mAbs 
isolated in this study provide tools to charac- 
terize allosteric conformational changes in S 
triggered by receptor binding or by receptor- 
mimicking mAbs (5/, 52), a process that has 
thus far eluded structural definition. These 
events lead to exposure of the fusion peptide 
region for proteolytic cleavage by TMPRSS2 
or endosomal cathepsins preceding membrane 
fusion (32). The finding that this region is 
readily accessible in alphacoronaviruses is 
consistent with the higher neutralizing activity 
of some fusion peptide-specific mAbs against 
NL63 and 229E and proposes that conforma- 
tional changes might occur spontaneously as a 
consequence of molecular breathing. 

Broadly neutralizing mAbs recognizing 
the fusion peptide region of influenza virus 
hemagglutinin and HIV-1 gp120 have guided 
the design of universal vaccine candidates against 
these highly variable pathogens (60-66). Sim- 
ilarly, the conservation of the fusion peptide 
region across the Orthocoronavirinae sub- 
family provides a rationale to develop a vac- 
cine capable of inducing fusion peptide-specific 
responses that may be less prone to viral escape 
because of the potential fitness loss of escape 
mutants (67, 68). The finding that prefusion 
stabilizing 2P mutations (4, 5) abolish receptor- 
induced exposure of the fusion peptide suggests 
that current mRNA vaccines may be suboptimal 
in inducing antibody responses against this 
region. The broadly neutralizing coronavirus 
mAbs described in this study may be used as 
probes to design immunogens that can better 


unmask the fusion peptide region and elicit 
broadly protective antibody responses. Nota- 
bly, the fusion peptide region was also found 
to stimulate broadly reactive CD4 T cells (69), 
providing a cue for intramolecular help in the 
generation of such antibodies. In conclusion, 
our study identifies neutralizing mAbs with 
unprecedented breadth and sheds light on a 
receptor-triggered conformational change that 
limits the immunogenicity of this conserved 
region, with potential impacts on universal 
coronavirus vaccine design. 
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Proton-coupled energy transfer in molecular triads 


Belinda Pettersson Rimgard'{, Zhen Tao“++, Giovanny A. Parada”, Laura F. Cotter’, 
Sharon Hammes-Schiffer?*, James M. Mayer@*, Leif Hammarstrém* 


We experimentally discovered and theoretically analyzed a photochemical mechanism, which we 
term proton-coupled energy transfer (PCEnT). A series of anthracene-phenol-pyridine triads formed 
a local excited anthracene state after light excitation at a wavelength of ~400 nanometers (nm), 
which led to fluorescence around 550 nm from the phenol-pyridine unit. Direct excitation of phenol- 
pyridine would have required ~330-nm light, but the coupled proton transfer within the phenol- 
pyridine unit lowered its excited-state energy so that it could accept excitation energy from 
anthracene. Singlet-singlet energy transfer thus occurred despite the lack of spectral overlap 
between the anthracene fluorescence and the phenol-pyridine absorption. Moreover, theoretical 
calculations indicated negligible charge transfer between the anthracene and phenol-pyridine units. 
We construe PCEnT as an elementary reaction of possible relevance to biological systems and 


future photonic devices. 


oupling of proton transfer to electronic 

transitions exerts a great influence on 

the energetics and kinetics of these 

processes. Photoacids are well-studied 

examples, where light excitation leads 
to arapid proton release and, in some cases, 
even produces long-lived pH changes and pH 
gradients (/, 2). Another example is excited- 
state intramolecular proton transfer (ESIPT), 
which generates an excited tautomer state of 
the light-absorbing molecule; systems where 
this state is fluorescent have applications for 
optoelectronic materials (3). Often, ESIPT 
instead allows for ultrafast dissipation of 
excess electronic energy as heat, and is there- 
fore an important photostabilizing mecha- 
nism in plastics (4) as well as in biomolecules 
such as DNA (5). 

More recently, the charge transfer mecha- 
nism of proton-coupled electron transfer (PCET) 
has generated great interest (6). PCET is a key 
reaction behind biological energy conversion 
in photosynthesis, respiration, and nitrogen 
fixation (7), as well as in synthetic chemical 
systems for photoredox catalysis and renewable 
energy conversion and storage (8-11). It is also 
involved in DNA synthesis and repair, photo- 
receptor function, and many other radical en- 
zyme reactions (12-15). Of particular interest 
is the concerted PCET mechanism, where the 
electron and proton are transferred without 
the formation of high-energy intermediates, 
thereby offering a low-barrier reaction pathway. 
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In all the above reactions, the coupled trans- 
fer of the proton changes the energy landscape 
of the overall process, which obviously affects 
its dynamics and rate. There are also important 
effects of the much greater mass of the proton, 
whose wave function is more localized than 
that of the electrons involved, which may 
impose a greater sense of directionality on the 
process. Understanding and analyzing such 
effects, theoretically and experimentally, is a 
fundamentally interesting challenge and of 
practical importance. 

Tn 2019, our groups used a series of anthracene- 
phenol-pyridine (An-PhOH-Py) triads (Fig. 1) (6) 
to demonstrate the first example of concerted 
PCET occurring in the Marcus inverted region 
(17). This behavior, where the rate decreases 
with increasing driving force, is important for 
slowing recombination after photochemical 
charge separation, but was previously consid- 
ered unlikely for concerted PCET reactions 
because of the nearly barrierless tunneling to 
higher vibrational product states (78). None- 
theless, photoexcitation of the An subunit of 
the triads results in the formation of a local 
excited state (LES) on the anthracene (Eq. 1) 
(16, 19). In tenths of picoseconds, the LES 
forms the charge-separated state (CSS) by elec- 
tron transfer from the PhOH group to the “An 
concerted with proton transfer to the pyridine 
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Fig. 1. Structures of the studied anthracene- 
phenol-pyridine ([An-PhOH-py]) triads. The enol 
form is the stable ground-state form, and the 
electronically excited keto form is the local electron- 
proton transfer (LEPT) state. 
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(Eq. 2). The charges subsequently recombine to 
the ground-state reactants (Eq. 3), once again 
in a concerted electron-proton transfer reac- 
tion but with a large driving force (>2.0 eV in 
CH;CN). This process consequently places the 
reaction in the inverted region, as was shown 
by the rate dependence on driving force and 
solvent polarity. 


An-PhOH-py —“> !*An-PhOH-py(LES) (1) 


1 An-PhOH-py—An*~-PhO*-pyH* (CSS) (2) 


An* -PhO"-pyH*—An-PhOH-py (3) 
The CSS intermediate was observed only in 
triads 1 to 3, however; in triads 4 to 8, a 
similar quenching of the LES did not result in 
an observable CSS (16). This result is puzzling, 
as the charge recombination driving force is 
larger for 4 to 8, meaning the reaction would 
be more deeply in the inverted region and 
thus even slower than for 1 to 3. Theoretical 
investigations have suggested the involve- 
ment of another state that would lie lower 
than the CSS in 4 to 8 (20, 27). This is the so- 
called local electron-proton transfer (LEPT) 
state, where the PhOH-py fragment is in a 
local, electronically excited tautomeric (keto) 
state, *[PhO-pyH], in which the phenolic proton 


Fig. 2. Steady-state A 
absorption (room 

temperature) and fluores- 
cence (77 K) spectra in 
butyronitrile. (A) Sche- 

matic Jablonski diagram 

of the possible transitions 

at 77 K for the triads: 

radiative decay (downward 

solid line) and nonradiative 
decay. The CSS energy 

depends strongly on 

the chemical environment 

(16, 19) and is higher Cc 
than the LES at 77 K. 

(B) Absorption (solid line) 
and fluorescence spectra 
normalized to the first 

An emission peak (dashed 
liné, Aexc: 400 nm) of 

1 (black) and the reference 
compounds of similar 
concentrations (4 to 7 uM): 
2,4-di-tert-butyl-6-(pyridin- 
2-yl)phenol (PhOH-py, 
blue) and 9-cyano-10-meth- 


1.6x10° + 
1.2x10° + 
8.0x10’ + |. 


4.0x10’ 5 


Emission Intensity (counts) 


0.0 


has transferred to the pyridine. The LEPT 
state is the same excited tautomeric state as 
that formed by ESIPT after excitation of the 
separate phenol-pyridine compound (22), and 
similar to other ESIPT compounds (3). This 
state was proposed to be populated after ex- 
citation in 4 to 8 (20, 21); it can possibly be 
populated from the CSS (Eq. 4) or be formed 
directly from the LES (Eq. 5) by a process 
here denoted proton-coupled energy trans- 
fer, PCEnT (see below). 


An‘~-PhO*-pyH* —An- *[PhO-pyH](LEPT) 
(4) 


1*An-PhOH-py —> An-*{PhO-pyH](LEPT) 


(5) 


Here, we present experimental and theoretical 
evidence for direct formation of the fluores- 
cent LEPT state from the LES at 77 Kina 
butyronitrile glass, in which formation of the 
CSS is inhibited. In this experimental example 
of PCEnT, excitation energy was transferred 
from **An to *[PhO-pyH], coupled to proton 
transfer within the PhOH-py unit (Eq. 5). This 
finding was surprising, as it is a singlet-singlet 
transition without any detectable spectral over- 
lap between the donor fluorescence and accep- 
tor absorption. Moreover, PCEnT occurred with 
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ylanthracene (CN-MeAn, 
green). (C) Fluorescence 
spectra of triads 1, 

2, 4, 5, 6, and 7 with 


400 450 500 550 600 650 700 750 
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negligible charge transfer between the An and 
PhOH-py units, which is in sharp contrast to 
PCET reactions. This result significantly ex- 
pands the scope of concerted processes where 
proton transfer controls reactivity. Many excited- 
state charge and energy transfer reactions in 
proteins and DNA occur between hydrogen- 
bonded pigments, making PCEnT possibly im- 
portant in natural systems. Yet such a reaction 
has, to the best of our knowledge, not pre- 
viously been reported. 


Spectroscopic observation of LEPT state 


The lack of observed CSS formation in triads 
4 to 8, together with the computationally 
predicted excited-state keto tautomer (LEPT) 
(20), laid the basis for further investigation. 
The triad absorption spectra agree well with a 
sum of contributions from the separate com- 
ponents: a vibrationally structured anthracene 
band around 400 nm and a PhOH-py band 
around 330 nm (Fig. 2B and fig. S1). The LEPT 
state should fluoresce around 500 nm in 
aprotic, rigid media, as has been shown for the 
excited keto state, *[PhO-pyH] of the separate 
phenol-pyridine compound (22). Therefore, we 
examined the triads (1, 2, 4 to '7) by fluores- 
cence spectroscopy in rigid butyronitrile glass 
at T = '77 K. We also studied these triads com- 
putationally and found that the minimum- 
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excitation at 400 nm (1 and 2), 365 nm (6), and 375 nm (5 and 7). Note that 7 was measured with twice the absorption at the excitation wavelength. (D) Absorption 
spectrum (black line), fluorescence excitation spectrum (Aem = 455 and 550 nm, solid line) and fluorescence spectrum (Aexe = 400 nm, dashed line) of 2. 
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energy LEPT geometry exhibits a ~90° twist 
between the phenol and pyridine fragments, 
whereas the planar LEPT geometry is a first- 
order saddle point. This twisting, which is 
known to quench the fluorescence (22, 23), 
is expected to be hindered in the glass at 77 K. 
Thus, the computational studies were per- 
formed for both the planar and twisted LEPT 
geometries. 

The 77 K fluorescence spectrum of 1, after 
selective excitation of the An unit at 400 nm, 
showed a structured ‘An fluorescence at 420 
to 580 nm, accompanied by a broad band with 
a maximum around 550 nm (Fig. 2B). The 
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latter is expected for the LEPT state (22) and 
was similarly observed for all measured triads 
with an electron-donating group on the pyridine: 
1, 2, 5, and 6 (Fig. 2, B and C). The LEPT emission 
shifted to lower energy when the electron- 
donating group was weaker (2 > 1 > 6), con- 
sistent with the computational results in 
tables S8 and S12. Because 1 and 5 share the 
same functional group (R, = 4-Me) on the 
pyridine, the LEPT energy should be similar. 
Still, the peak for 5 was blueshifted by 11 nm, 
which indicated a small but significant cou- 
pling to the An unit, which differs in its func- 
tional group (CN in 1; H in 5) between these 
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triads. The relative intensity of the “An and 
LEPT fluorescence was independent of triad 
concentration (fig. S2), and control experi- 
ments with 9-CN-10-Me-An showed only the 
structured ‘*An fluorescence (Fig. 2B). This 
excluded any excimer or other anthracene 
complex as responsible for the broad fluores- 
cence around 550 nm. 

Fluorescence excitation spectroscopy pro- 
vided unambiguous evidence for LEPT state 
formation from excited *An (Fig. 2D). The 
shape of the excitation spectrum of the 550-nm 
LEPT emission peak (purple line) matched that 
of the absorption spectrum (black line), with 
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Fig. 3. Transient fluorescence emission spectra in butyronitrile at 77 K. 
(A) Contour map of the transient fluorescence emission intensity counts (dark 
blue = highest intensity) of 6 in butyronitrile (0.2 mM) between 390 and 

640 nm within a 2-ns time window. (B) Emission spectra at given times after 
excitation with 370-nm light. The **An fluorescence reaches a maximum at 


*PhOH-py 


Twist 


Fig. 4. Potential energy surfaces and optimized structures for the electronic 
states discussed. (A) Schematic potential energy surfaces for the ESIPT and 
PCEnT reactions, color-coded to mark the electronically excited fragment 

(green, An; purple, PhOH-py). Note that ESIPT is barrierless on the excited-state 
potential energy surface (purple), whereas PCEnT follows the reorganization 

of the heavy nuclei including the solvent (green to purple). In fluid solution, the 
LEPT state will twist along the PhOH-py bond, where 90° leads to a conical 
intersection, and nonradiative decay. This twist is impeded in the 77 K butyronitrile 
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~30 ps (instrumental response function IRFewyy = 65 ps) and decreases 
simultaneously with the the rise of LEPT fluorescence that maximizes after 
~140 ps. (C) Time-resolved fluorescence traces (lines, fit; circles, data) within 
the LES (445 nm, blue line) and LEPT (570 nm, green line) bands; the IRF is 
shown as a dashed line. 


glass, although a partial twist could occur to lower the energy of the LEPT state 
relative to the LES (table S11). GS, ground state. (B) The optimized structures and 
the natural bond orbital [NBO (29)] charges on the anthracene (blue), phenol 
(red), and pyridine (purple) subunits of 1 for the GS, LES, CSS, and planar (not 
twisted, first-order saddle point) LEPT state in the gas phase. The charges reflect 
the electron density, where a more negative number equals a more electron-dense 
fragment (the methylene spacer between anthracene and phenol is not included in any 
of the fragments, and therefore those charges are excluded in this analysis). 
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contributions from both the PhOH-py (330 nm) 
and the An (300 to 430 nm) absorption bands. 
The excitation spectrum of the 450-nm “An 
emission (green line) showed instead a weaker 
relative contribution from the 330-nm PhOH-py 
band. The 550-nm LEPT state could thus be 
formed either by exciting the PhOH-py unit, 
which would cause formation of the LEPT 
state by the well-established ESIPT mecha- 
nism, or directly via excitation of the An unit 
followed by energy transfer from An. 


Evidence for proton-coupled energy 
transfer mechanism 


The excitation energy transfer from the an- 
thracene donor to the phenol-pyridine unit 
to yield the LEPT state is a type of mecha- 
nism that we denote proton-coupled energy 
transfer, PCEnT (Fig. 2A). Energy transfer di- 
rectly to the *[PhOH-py] state would be strongly 
uphill, as is seen from its low-wavelength ab- 
sorption spectrum around 330 nm, but proton 
transfer lowers the phenol-pyridine excited-state 
energy to make the PCEnT process energetically 
feasible. The quantum yield for the LEPT for- 
mation was deemed to be similar irrespective 
of which fragment was excited, on the basis of 
the good agreement between the shapes of the 
absorption spectrum (Fig. 2D) and the 550-nm 
excitation spectrum, where the respective 
PhOH-py and An amplitudes were fairly equal 
in the two spectra. This fact shows that PCEnT 
from “An was about as efficient as ESIPT from 
*[PhOH-py]. 

To follow the formation and decay of the 
LEPT state in real time, we performed ultra- 
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fast transient fluorescence spectroscopy mea- 
surements of 6 with excitation at 370 nm, in 
the mid- to red part of the An absorption band 
(Fig. 3). The observed features matched those 
seen in the steady-state experiments, and global 
fits were performed within the peak regions 
of the LES (380 to 490 nm) and LEPT (520 to 
655 nm) fluorescence (Fig. 3B). The rise time 
of the LEPT peak was fitted to ~44 ps and a 
similarly short decay component could be 
found for the LES (46 ps), which reinforced the 
notion that the LEPT state could be populated 
directly from the LES (as depicted in the overall 
reaction scheme in Fig. 2A). 

We can exclude LEPT formation via a sequen- 
tial LES-CSS-LEPT reaction at 77 K, because 
the CSS would be substantially destabilized in 
asolvent glass (Fig. 2A) as a result of its greater 
dipole moment relative to both the LES and 
LEPT (Fig. 4B). Moreover, the solvent glass at 
77 K prevented the large degree of twisting of 
the An group to reach the CSS optimized 
structure (Fig. 4B and table S1), which further 
destabilized the CSS energy. The CSS lies less 
than 0.2 eV below the LES at room temper- 
ature (16), and small-molecule charge separa- 
tion is typically destabilized by more than 
0.5 eV by freezing out solvent repolarization 
in polar solvents (24). Our gas-phase calcula- 
tions in table S6 indeed placed the CSS higher 
than the LES. Moreover, the LEPT fluores- 
cence was formed faster in 6 (~44 ps) than in 
1 (~69 ps), despite greater room-temperature 
driving force for CSS formation (by ~0.5 eV) in 
the latter (16); this is clear experimental proof 
against a reaction involving the CSS. 
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General reaction coordinate 


The photophysics of triads 1 to 8 have been 
previously studied by transient absorption (TA) 
at ambient temperature (16), and 6 was also 
examined by time-resolved fluorescence spec- 
troscopy at various temperatures (25). In the 
TA study at room temperature, the initial an- 
thracene LESs in 1 to 3 decayed to a long-lived 
state (up to t = 755 ps) with spectral features 
in agreement with the CSS being formed by 
PCET, including obvious features of the an- 
thracene anion. The same mechanism was 
assigned to the decay of the LES of 4 to 8 on 
the basis of the close correlation of LES decay 
rates with the free energies of PCET for each 
triad. These triads did not show a long-lived 
transient in the TA spectra at ambient tem- 
peratures. However, computations later placed 
the LEPT state below the CSS for 4 and 6 (20), 
indicating that rapid CSS-to-LEPT conversion 
and/or a contribution of PCEnT from the LES 
would be reasonable. 

The previously reported spectrally resolved 
ultrafast fluorescence spectra of 6 at ambient 
temperatures showed an H/D kinetic isotope 
effect (KIE) = 4 and did not show any evidence 
of emission from a LEPT state. At lower tem- 
peratures, in a 2-methyl-tetrahydrofuran 
(2-Me-THF) glass or a polymethylmethacrylate 
(PMMA) film, the subnanosecond integrated 
fluorescence followed a single exponential with 
t= 150 ps, with an unexplained long-time scale 
(>> ns) component. The data in 2-Me-THF 
were obtained in the presence of 3% methanol, 
which was observed to quench the LEPT emis- 
sion in the current study. Therefore, the two 
studies are not directly comparable. In retrospect, 
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Fig. 5. Avoided-crossing region between the S, and S2 adiabatic electronic 
states for triad 6, corresponding to a crossing between the LES and LEPT 
diabatic electronic states, and analysis of the S; excited state for relevant 
configurations. The energy curves were obtained from an excited-state adiabatic 
molecular dynamics trajectory and are plotted along a unitless general reaction 
coordinate rather than time; this trajectory is far from equilibrium and is not directly 
comparable to experiment because of the reaction conditions (supplementary text). 
(A to C) Configurations obtained from this trajectory. The dominant natural 
transition orbitals (NTOs), corresponding to the transitions from occupied to 
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virtual orbitals that characterize the S; excited state, are shown for these 

three configurations. For (A), the NTOs are localized on the anthracene (LES), whereas 
for (C), the NTOs are localized on the phenol-pyridine fragment (LEPT state), 

where the proton has mostly transferred. For (B), an approximately equal mixture 
of LES and LEPT is observed, as indicated by the two NTOs. Above the orbital 
drawings of each configuration, the numbers are the charges (in a.u., the elementary 
charge of an electron) of the anthracene, phenol, and pyridine fragments for the S; 
state, in that order (excluding the charge of the CH bridging group and the 
transferring hydrogen, which has a charge of ~0.5 a.u.). 
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these data could be consistent with the LES 
decaying either by PCET or by the new PCEnT 
pathway reported here in butyronitrile glasses; 
additional studies are needed. 


Physical principles underlying proton-coupled 
energy transfer 


The observation that PCEnT directly formed 
the excited tautomeric LEPT state was sur- 
prising and needs mechanistic clarification. As 
both the “An and the LEPT state are fluores- 
cent, PCEnT is clearly a singlet-singlet process. 
However, there was no detectable spectral 
overlap between the donor fluorescence and 
acceptor ground-state absorption (Fig. 2B), 
which is a requirement for Forster (dipole- 
dipole) energy transfer. The absence of low- 
energy absorption of the keto form [PhO -pyH*] 
is because the ground state, at the LEPT ge- 
ometry, lies ~0.5 eV higher than the enolic 
ground state (table S4) and was thus not 
populated to any appreciable extent. Instead, 
as the reaction occurred at quite a small 
donor-acceptor distance (~6 A between the 
central rings of An and PhOH), the coup]- 
ing was presumably dominated by exchange 
and/or penetration terms (26). Any stron- 
ger coupling mechanism was probably pre- 
vented by the methylene spacer between the 
subunits. 

We propose, therefore, that PCEnT could be 
described by a nonadiabatic surface crossing 
(Fig. 4A), in analogy to Marcus-type theoretical 
descriptions of PCET (6, 7). Thermal fluctua- 
tions of heavy nuclei lead to a configuration 
where electronic energy transfer and proton 
tunneling occur with energy conservation 
(crossing point of the reactant and product 
free energy curves). It is different from PCET, 
however, in that there is no charge transfer 
between donor and acceptor. The product 
state is a fluorescent excited state when phenol- 
pyridine twisting is hindered, and can thus be 
readily detected. Moreover, this mechanism 
differs from an ESIPT reaction, where proton 
transfer is driven by intramolecular charge 
redistribution, and the electronic coupling is 
usually strong, which leads to ultrafast reac- 
tions. The PCEnT reaction described here was 
instead driven by a remote, weakly coupled 
electronically excited state. 

The rearrangement from the enol to the keto 
form of the phenol-pyridine acceptor unit is 
much greater than just the transfer of a proton, 
as it involves substantial charge redistribution 
and changes in bond lengths and angles. These 
combined effects change the excitation energy 
of the phenol-pyridine acceptor unit and allow 
it to come into resonance with that of the 
anthracene. This behavior is the fundamental 
basis for coupling of proton transfer to energy 
transfer and distinguishes PCEnT from typi- 
cal energy transfer reactions. The details of 
this coupling and the role of coupling with 
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different proton vibrational states remain to 
be further explored. 

Figure 5 demonstrates the PCEnT process 
observed in a molecular dynamics trajectory 
for triad 6 propagated on the S, state, which 
changed from LES to LEPT character as the 
proton transferred from the phenol to the pyr- 
idine. Representative configurations along this 
trajectory, along with an analysis of the frag- 
ment charges and dominant natural transition 
orbitals, show that the character of the S, state 
changed from LES to a mixture of LES and 
LEPT and then to LEPT. The nonadiabatic 
coupling between the S, and S, adiabatic 
electronic states (27, 28) exhibited a peak at 
the avoided crossing, providing evidence that 
this process is nonadiabatic. Throughout this 
process, the charge on the anthracene fragment 
remained virtually zero, indicating that the 
LES-to-LEPT transition did not involve charge 
transfer between anthracene and the phenol- 
pyridine group, consistent with the PCEnT 
mechanism. Moreover, the net charge trans- 
fer from the phenol to the pyridine was also 
nearly zero after accounting for the charge of 
the transferring proton. In the framework of 
the vibronically nonadiabatic PCET theory 
(6), the transferring proton would be treated 
quantum mechanically and would tunnel 
during the LES-to-LEPT transition. 


Conclusions and outlook 


We have experimentally proved the involve- 
ment of the excited *[PhO-pyH] (LEPT) state, 
in the photochemical reactions of these triad 
molecules. This mechanism competes with 
PCET from the LES and could explain why 
the CSS was not observed in 4 to 8. The LEPT 
state was formed by a novel proton-coupled 
energy transfer (PCEnT) mechanism. This is 
a singlet-singlet exchange energy transfer, 
coupled to proton transfer in the acceptor 
unit, that can be described as a nonadiabatic 
transition to the excited product state. Its free 
energy barrier in turn is determined by the 
reorganization energy associated with the 
solvent and other heavy nuclei, similar to 
Marcus ET theory or PCET theories (6, 17). In 
contrast to PCET, there is negligible charge 
transfer between donor and acceptor. 

This mechanism can have several intriguing 
implications for both natural and synthetic 
systems. PCEnT may allow for the photosen- 
sitization of otherwise dark states, such as 
seen here by the LEPT formation from the low 
energy excitation of the LES, rather than the 
direct excitation of the PhOH-py unit at higher 
energy (ca 330 nm). Moreover, as energy trans- 
fer occurs over long distances but proton 
transfer is short-range, we suggest that this 
mechanism may be of importance to direct 
energy migration in e.g., light-harvesting sys- 
tems and photonic materials, where the PCEnT 
acceptor would form a low-energy site. PCEnT 


could also form the basis of a molecular switch 
for energy transfer, where proton transfer 
could be controlled independently by hydrogen- 
bonding or (de)protonation, thus effectively 
turning energy transfer on or off. We speculate 
that these processes could already be oper- 
ational in natural systems with hydrogen- 
bonded light-absorbers, such as porphyrins in 
photosynthetic assemblies, DNA and flavin 
proteins, but have yet to be identified. 
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Synthetic genetic circuits as a means of 


reprogramming plant roots 
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The shape of a plant’s root system influences its ability to reach essential nutrients in the 

soil and to acquire water during drought. Progress in engineering plant roots to optimize 

water and nutrient acquisition has been limited by our capacity to design and build genetic 
programs that alter root growth in a predictable manner. We developed a collection of synthetic 
transcriptional regulators for plants that can be compiled to create genetic circuits. These circuits 
control gene expression by performing Boolean logic operations and can be used to predictably alter 
root structure. This work demonstrates the potential of synthetic genetic circuits to control gene 
expression across tissues and reprogram plant growth. 


ecent advances in our understanding of 

the molecular mechanisms driving plant 

development have identified key regu- 

lators of agronomic traits, but our limited 

ability to control gene expression in 
plants is a barrier to applying this knowledge 
(, 2). Indeed, it remains challenging to express 
genes in specific patterns in plants, especially 
if those patterns are outside the reach of 
native promoters (3-6). One attractive solu- 
tion is synthetic genetic circuits, which have 
been constructed in a wide array of prokary- 
otic organisms and eukaryotic cell lines (7-10). 
However, circuit technology has been difficult 
to implement in plants because of the long 
time scales required for producing transgenic 
lines and the difficulty of tuning circuit activ- 
ity across heterogeneous cell types (77). Here, 
we show that quantitative transient expres- 
sion assays can be used to aid the construction 
of circuits in plants by providing a platform for 
testing and tuning circuit designs. We show 
how these assays can be leveraged to improve 
the performance of circuits designed to pro- 
duce specific spatial patterns of gene expres- 
sion across root tissues, and we use one of 
the circuits to precisely modify plant root 
architecture. 
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Constructing synthetic genetic circuits 

in plants 

To construct genetic circuits in plants, we first 
generated a collection of synthetic transcrip- 
tional regulators. We designed simple tran- 
scriptional activators [composed of bacterial 
DNA binding proteins, VP16-activation do- 
mains, and SV40 nuclear localization signals 
(NLSs)], and synthetic repressors that rely on 
steric hindrance to achieve repression (com- 
posed of only DNA binding proteins and NLSs) 
(Fig. 1, Aand B) (12-16). Similar to previous 
synthetic promoter designs, activatable plant 
promoters were created by fusing six copies of 
the DNA sequence (operator) bound by these 
transcription factors (TFs) to a minimal plant 
promoter [positions -66 to +18 of the cauli- 
flower mosaic virus (CaMV) 35S promoter] 
(Fig. 1A) (77, 18). Repressible promoters were 
built by placing one operator sequence at the 
3’ end of a full-length CaMV 35S promoter 
(Fig. 1B) (19). This design was selected to avoid 
disrupting 35S promoter activity when adding 
operators. 

Synthetic TF activity was measured in plants 
by delivering DNA encoding the proteins and 
their target promoters to Nicotiana benthamiana 
leaves using Agrobacterium tumefaciens. To 
enable quantitative measurements of gene ex- 
pression, we used the synthetic promoters to 
drive expression of green fluorescent protein 
(GFP) and normalized GFP expression to a con- 
stitutively expressed mCherry encoded on the 
same T-DNA (see supplementary materials). 
We also introduced an intron to GFP to prevent 


Agrobacterium from expressing the reporter 
and confounding fluorescence measurements 
(fig. S1). Nine of the 10 synthetic activators 
were able to turn on their target promoters 
in N. benthamiana (factor of 3 to 45 increase 
relative to promoter-only controls), and seven 
of these were specific to the promoters they 
were designed to regulate (Fig. 1, C and D). 
Specificity is needed to construct circuits free 
of cross-talk between components (20). In con- 
trast to the activators, only four of the synthetic 
repressors generated more than a factor of 2 
change in gene expression, and none achieved 
complete repression (Fig. 1, E and F). Thus, 
many of the repressors required further opti- 
mization before they could be used to construct 
circuits. 

To achieve greater control over expression, 
we modified the synthetic TFs. Replacing the 
human herpesvirus-derived VP16 activation 
domain with the activation domain from 
Arabidopsis ETHYLENE RESPONSE FACTOR 
2. (ERF2“”) improved activity of the PhIF-based 
synthetic activator by nearly a factor of 6 (Fig. 
1G) (21). A small fragment of ERF2 (78 amino 
acids) was used to reduce the possibility of 
regulation by native plant components, such 
as small RNAs, kinases, etc. (27). In contrast to 
the activators, adding a repressor domain to 
the AmtR-based synthetic repressor either 
had no effect [TOPLESS (TPL®”)] or only 
modestly increased repression (relative to 
NLS only) by 20% [ETHYLENE RESPONSE 
FACTOR 4 (ERF4®?)] or 33% [SALT TOL- 
ERANCE ZINC FINGER (ZAT10®”)] (Fig. 
1H) (22, 23); thus, swapping the activation 
domain led to increased expression, whereas 
adding repressive domains had a more modest 
effect. 

We also engineered the synthetic promoters 
to tune expression levels and increase dynamic 
range. Changing the number of TF binding sites 
in the AmtR-activatable promoter resulted in 
a collection of promoters of varying strength. 
Promoters with three or more operators am- 
plified expression relative to the constitutive 
G1090 promoter used to drive expression of 
AmtR-VP16 (Fig. 11, compare 6x to 3x with 
control) (24). In contrast, promoters with two 
operators roughly replicated G1090’s expres- 
sion level and one operator produced less 
expression (Fig. 1I). These data suggest that 
synthetic promoters can be used to tune gene 
expression levels by either amplifying or 
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Fig. 1. Basic building blocks for constructing synthetic genetic circuits. 
(A and B) Schematics of the synthetic transcriptional activators (A) and 
repressors (B) built to control gene expression in plants. Small bent arrow 
denotes the transcription start site. Activity (C) and specificity (D) of the 
synthetic activators. Yellow box highlights orthogonal activators. (E) Activity 
of the synthetic repressors. (F) Fluorescence of leaves infiltrated with a 
plasmid that does not encode GFP. (G) Comparison of PhiF-based activators 
containing either the VP16 (V16) or ERF2 (E2) activation domain. (H) Comparison 
of AmtR-based repressors containing either no repressive domain or a 
repressive domain from TOPLESS (TPL), ZAT10 (Z10), or ERF4 (E4). 

Control (—) is promoter alone. (I) Activity of synthetic activatable promoters 


dampening the transcriptional signal produced 
by an input promoter. 

In line with a previous study, we found that 
moving the location of the operator to a region 
between the TATA and CAAT boxes within 
the 35S promoter improved repression and 
dynamic range (Fig. 1J) (0). Adding a second 
operator between the TATA box and the tran- 
scription start site further improved repression 
and dynamic range (Fig. 1J). However, adding 
a third binding site at the 3’ end of the 
promoter reduced its ON state without signif- 
icantly reducing the repressed state, resulting 
in a smaller dynamic range than the two- 
operator promoter (Fig. 1J, 58x versus 64x). 
These design features were transferrable to 
other synthetic repressor-promoter pairs (fig. 
$2), demonstrating that the number and loca- 
tion of operators can be adjusted to tune pro- 
moter activity. 

Using these synthetic regulators, we con- 
structed circuits that perform Boolean logic 
operations. Synthetic TFs built with the AmtR 
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and PhIF DNA binding proteins, which dem- 
onstrated strong activation and repression in 
our initial TF designs, served as the inputs to 
all circuits, and GFP served as the output. Cir- 
cuit activity was measured in N. benthamiana 
leaves, which were infiltrated with multiple 
Agrobacterium strains, each containing one 
plasmid that encoded either an input TF or 
the output (Fig. 2, A and B). 

Computation was performed by synthetic 
promoters that responded to the input TFs in 
unique ways. Simple gates, like the A and B 
BUFFER gates, used synthetic promoters that 
responded to one input TF; more complex gates 
required synthetic promoters that responded to 
multiple inputs (Fig. 2C). To create functional 
circuits, both the promoter architecture and 
the synthetic TFs needed to be optimized 
(fig. S3). We found that the arrangement of 
operators in the OR promoter affected fold 
change, with the best design containing 
alternating pairs of operators (Fig. 2C and fig. 
83). Additionally, synthetic repressors needed 


AmtR Promoter variant 


containing one (1x) to six (6x) AmtR operators co-infiltrated with a plasmid 
encoding proG0190::AmtR-VP16. Control construct (C) is proG0190::GFP. 

(J) Engineered synthetic promoters’ response to AmtR-NLS. In all panels, expression 
jamiana leaves. TFs are expressed using the constitutive 
G1090 promoter. Box plots show the median of 12 leaf punches collected from three 
leaves that were infiltrated and measured on different days. Box plot hinges 
indicate the first and third quartiles. Dots show individual data points. Stars 
denote significant differences in normalized GFP expression and letters denote 
significance groups (P < 0.01, Student's two-tailed t test); n.s., not significant. 
ificantly active transcription factors was calculated by 
dividing the average ON state by the average OFF state. 


to include repressive domains, such as ERF4"”, 
to prevent synthetic activators from initiat- 
ing transcription at composite promoters 
(1-1 state, A NIMPLY B; Fig. 2C and fig. S3). 
However, this requirement could be over- 
come by increasing the number and location 
of repressor operators within the synthetic 
promoter (B NIMPLY A; fig. S3). Most circuits 
involved several design-build-test cycles, which 
were facilitated by the rapid N. benthamiana- 
based assays and the modular nature of the 
synthetic biology parts generated here. Addi- 
tional tuning information is available in figs. 
$3 and S4. 

Layered logic gates in which AmtR and 
PhIF do not directly control expression of GFP, 
but instead modulate the expression of other 
synthetic TFs, occasionally encountered prob- 
lems where expression cassettes were insuffi- 
ciently insulated from each other. The A IMPLY 
B gate, which worked well when its output 
genes were encoded on separate plasmids, had 
an erroneously reduced “no input” state when 
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Fig. 2. Logic gates in N. benthamiana leaves. (A) Schematic for testing 
circuit activity in N. benthamiana using a transient expression assay (see 
supplementary materials). N. benthamiana leaves were infiltrated with 
Agrobacterium strains containing plasmids that encode either one input 
transcription factor (A or B) or the output gene GFP. mCherry was encoded 
on the output plasmid and used to normalize fluorescence measurements. 
(B) Example images of leaves infiltrated with TRUE and FALSE gate 
components. Circles encompass areas of infiltration. (©) Gate behavior in 
N. benthamiana. Bar charts show output of each circuit, reported as the 
ratio of GFP to mCherry. Green bars indicate gate states that should be ON; 
gray bars indicate states that should be OFF to implement correct logic. 
Data are mean and SD of 12 leaf punches collected from three leaves 
infiltrated and measured on different days. Dots show individual data points. 
Data for individual GFP and mCherry channels are provided in the 
supplementary materials. 
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Fig. 3. Patterning gene expression using logic gates. (A) Cell types in the 
Arabidopsis root tip: columella (C), lateral root cap (R), epidermis (E), cortex (X), 
endodermis (N), stele (S), quiescent center (Q). (B) Confocal images show the 
expression pattern of input promoters SOMBRERO (SMB) (input A) and PIN-FORMED 
4 (PIN4) (input B). (C) Confocal images (left) and expected expression patterns 
(right) of logic gates that use the SMB and PIN4 promoters to express input TFs. 
Output of each circuit is nuclear localized GUS-GFP fusion protein. In all panels, 
T-DNA schematics show the identity and arrangement of circuit components. Root 
images were taken 5 days after sowing. Box-and-whisker plots show quantified 
reporter expression for three T2 plants from a single transgenic line. Green boxes 
indicate cell layers that should be ON and gray bars indicate states that should 
be OFF to implement correct logic. Expression levels of individual cells in each 
root layer are shown as black dots. Box plot hinges indicate the first and third 
quartiles. Letters denote significant differences in expression (P < 0.01, Student's 
two-tailed t test). Additional independent lines in fig S6. 


both output genes were encoded on the same 
plasmid (Fig. 2C and fig. S4). In an attempt to 
fix this problem, we added one or two copies 
of the previously characterized insulator 
Arabidopsis thaliana MATRIX ATTACH- 
MENT REGION 10 (MARIO) between the two 
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A IMPLY B output genes (25). However, in- 
sulation only improved gate activity by a factor 
of 2 (fig. S4, version 2 versus versions 3 and 4). 
Consequently, the other IMPLY gate (B IMPLY 
A) was built with two output plasmids that 
were co-delivered to plant cells (Fig. 2C). 


Engineering spatial patterns of 

gene expression 

Functional plant circuits were transferred to 
the model plant A. thaliana to test their capac- 
ity to generate specific spatial patterns of gene 
expression across root tissues. The tissue-specific 


12 AUGUST 2022 * VOL 377 ISSUE 6607 749 


RESEARCH | RESEARCH ARTICLES 


A sir-1 


proGATA23:: BUFFER 2X 


BUFFER 1X BUFFERM3 BUFFER M2 


BUFFER M1 WT 


sir-1 [> 
a. cs | ~ 


| 


Fig. 4. Engineering root branch density using BUFFER gates. (A) Root 
structure (top) and root hairs (bottom) of Arabidopsis plants engineered to 
modify root branch density. BUFFER gate output promoters contain either one 
(1x) or two (2x) copies of the consensus AmtR operator or one copy of a 
mutated AmtR operator (M1 to M3). Scale bar, 1 cm. (B) Lateral root density 
calculated as the number of emerged lateral roots divided by total primary root 


promoters of SOMBRERO (proSMB, expres- 
sed in the entire root cap) and PIN-FORMED 
4 (proPIN4, expressed in columella, root cap, 
and stele) were used to drive expression of 
our input TFs (Fig. 3, A and B) (26, 27). By 
combining the activity of these input pro- 
moters using logic gates, we expected to generate 
several different spatial patterns of gene ex- 
pression (Fig. 3C, cartoons). 

We tested eight different logic gates in 
Arabidopsis, four of which generated the ex- 
pected expression pattern upon first attempt 
(TRUE, FALSE, A BUFFER, NOT A; Fig. 3C). 
Successful gates qualitatively matched the 
expected expression patterns and produced 
a significant difference between GFP expres- 
sion in tissues expected to be ON versus those 
expected to be OFF. By this definition of 
success, TRUE, FALSE, A BUFFER, and NOT 
A gates were successful. For NOT A, the 
difference between the lowest ON (cortex) 
and highest OFF (root cap) states was only 
1.2x; this finding suggests that further opti- 
mization may be necessary for applying the 
circuit in other contexts (Fig. 3C). 

The gates that did not work revealed differ- 
ences in circuit behavior between Arabidopsis 
and N. benthamiana. For example, the first B 
BUFFER gate produced a spatially expanded 
expression pattern relative to the input promoter, 
with aberrant expression in the quiescent center 
(QC) and neighboring initial cells (figs. S5 and 
S6). Similarly, the A NIMPLY B pattern was 
incorrect in Arabidopsis; expression was missing 
in several root cap cell layers (figs. S5 and S6). 

We tuned the B BUFFER, A NIMPLY B, OR, 
and NOR gates to improve performance. For B 
BUFFER gate, we changed the input TF from 
AmtR-VP16 to QacR-VP16, another highly active 
synthetic activator in N. benthamiana (Fig. IC, 
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Fig. 3C, and fig. S5). This change was made to 
eliminate potential interactions between the 
promoter (proPIN4) and the downstream gene. 
For A NIMPLY B, we removed the ERF48? 
from the input TF (PAIF-ERF4). ERF4 is 
believed to recruit histone-modifying enzymes 
and may maintain a repressed state in cells 
that are not actively expressing the repressor 
(23). Successful A NIMPLY B gates were built 
with repressors that lack the repression do- 
main (PAIF-NLS) (Fig. 3C and fig. $5). Tuning 
of the remaining gates (OR and NOR) and a 
description of our troubleshooting process 
are outlined in figs. S5 to S7. 

After tuning, all gates qualitatively matched 
the expected expression patterns (Fig. 3C). How- 
ever, three had expression in a single cell layer 
that was either aberrantly high (B BUFFER, 
endodermis) or low (OR, stele; NOR, QC). Thus, 
quantitative analysis highlights the challenge 
of implementing circuits across cell types in 
heterogeneous tissue and additional optimi- 
zation would be required for these gates to 
achieve significant differences across every 
tissue layer. 


Modifying root structure 


To demonstrate how precise spatial control 
over gene expression may be used to engineer 
development, we modified a key aspect of 
Arabidopsis root architecture: lateral root 
branch density. Lateral roots allow plants to 
radially sample soil, and the number of lateral 
roots that a plant generates affects its ability 
to search for water and essential nutrients in 
the environment (28). The close relationship 
between root growth and plant fitness has 
led to proposals of ideal root architectures 
for plant growth in specific environments 
(called root ideotypes) (29). Ideotype hypothe- 
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length. (C) Primary root length measured from root-hypocotyl junction to the 
root tip. Control samples are labeled as follows: sir-1 (slr), proGATA23::sir (G), 
wild type (WT). In (B) and (C), box plots show the median of at least 20 T1 
plants. Box plot hinges indicate the first and third quartiles. Letters denote 
significant differences in LR density or primary root length (P < 0.01, Student's 
two-tailed t test). All measurements and images were taken 10 days after sowing. 


ses have been difficult to test directly because 
genetic changes that affect root structure often 
have pleiotropic effects. For example, a gain-of- 
function mutation in the developmental reg- 
ulator INDOLE-3-ACETIC ACID INDUCIBLE 
14 (IAAT4) called solitary root (slr-1) eliminates 
root branching, but also hinders root gravit- 
ropism, root hair development, and primary 
root growth (Fig. 4 and fig. $8, slv-1) (30). To 
disentangle root branching from other develop- 
mental processes, we expressed the s/7-7 mutant 
gene using a tissue-specific promoter that is 
only ON in lateral root stem cells, proGATA23”?. 
When expressed from proGATA23, slr-I’s impact 
on development is restricted to lateral roots. In 
these plants, no lateral roots form, but gravit- 
ropism, root hair development, and primary 
root growth are normal (Fig. 4 and fig. S8, 
proGATA23::slr-1). 

We designed BUFFER gates to express 
slr-I at varying levels in lateral root stem cells 
to determine whether lateral root branch de- 
velopment could be quantitatively controlled. 
These BUFFER gates use the GATA23 promoter 
to drive expression of the AmtR-VP16 synthetic 
TF, which then activates a synthetic promoter 
with one, two, four, or six copies of the AmtR 
operator to drive expression of slr-1. Because 
plant transformation randomly inserts trans- 
genes into the genome, a single construct can 
create a range of expression. Although this 
variation potentially alleviates the need for 
promoters of varying strength, none of the 
plants (>20 independent lines per construct) 
containing proGATA23::slr-1 or a BUFFER 
gate with two, four, and six copies of the AmtR 
operator produced lateral roots; this suggests 
that the range of expression conferred by these 
constructs was above the threshold for fully 
blocking lateral root development (Fig. 4, A 
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and B; data not shown for 4x and 6x pro- 
moters). To further reduce the strength of the 
output promoter, we mutated key residues in 
the AmtR binding site (fig. S9) (31. Weaker 
synthetic promoters resulted in BUFFER gates 
that produced a gradient of lateral root den- 
sities, roughly correlated with the strength of 
the synthetic promoter (Fig. 4 and fig. S10). 
None of the BUFFER gates affected primary 
root growth, root hair development, or gravit- 
ropism (Fig. 4, B and C, and figs. S8 and S10). 
Thus, BUFFER gates enabled specific tuning 
of lateral root density. Our results show that 
the difference in regulatory capacity conferred 
by our synthetic system has a greater effect on 
root branching than the T-DNA insertion site, 
but they do not define how a specific dosage of 
slr-1 determines lateral root development; the 
resources generated here can be used to ex- 
plore this relationship further. 

Our work presents a roadmap for the design 
and implementation of synthetic genetic cir- 
cuits that program gene expression across cell 
types in plants. This approach expands the 
impact that a handful of characterized promot- 
ers can have on our ability to express genes 
in specific cells and presents an alternative to 
searching genomes for tissue-specific promoters 
with desired expression patterns. The ability to 
modulate expression in a tissue-specific manner 
provides an opportunity to probe gene dosage 
effects in a tissue-specific manner, which previ- 
ously lacked a reliable framework in plants. As 
circuit technology matures in plants, it could 
be applied to reprogram responses to the en- 
vironment or to better understand gene ex- 
pression control in plants, as has been done 
recently in Drosophila (32, 33). 

Reprogramming crops using synthetic genetic 
circuits will require careful tuning. As evidenced 
here, controlling expression levels across tis- 
sues can be challenging, and although some ap- 
plications may tolerate aberrant expression in a 
few cell layers, others will be extremely sensi- 
tive to off-target expression. To efficiently engi- 
neer genetic circuits that meet specific design 
requirements, researchers can leverage the 
growing number of transient expression methods 
available for plants (34, 35). Advances in single- 
cell measurements and spatially resolved omics 
will also support the construction of circuits in 
multicellular organisms by enabling more precise 
measurements (36, 37). Ultimately, methods for 
programming novel traits in plants will become 
increasingly useful as climate challenges grow and 
new agricultural solutions are needed. 
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Controlled tough bioadhesion mediated by ultrasound 


Zhenwei Ma’, Claire Bourquard’, Qiman Gao*, Shuaibing Jiang’, Tristan De lure-Grimmel*, Ran Huo!, 
Xuan Li’, Zixin He’, Zhen Yang’, Galen Yang®, Yixiang Wang®, Edmond Lam®’, Zu-hua Gao®, 


Outi Supponen*, Jianyu Li2?* 


Tough bioadhesion has important implications in engineering and medicine but remains challenging to 
form and control. We report an ultrasound (US)-mediated strategy to achieve tough bioadhesion 

with controllability and fatigue resistance. Without chemical reaction, the US can amplify the 

adhesion energy and interfacial fatigue threshold between hydrogels and porcine skin by up to 100 and 
10 times. Combined experiments and theoretical modeling suggest that the key mechanism is US- 
induced cavitation, which propels and immobilizes anchoring primers into tissues with mitigated barrier 
effects. Our strategy achieves spatial patterning of tough bioadhesion, on-demand detachment, and 
transdermal drug delivery. This work expands the material repertoire for tough bioadhesion and enables 
bioadhesive technologies with high-level controllability. 


ioadhesive technologies find frequent 
use in wearable electronics, biomedical 
implants, wound management, anasto- 
mosis, regenerative medicine, and drug 
delivery (1-5). However, their use has 
long been hindered by the barrier effects of 
biological tissues, such as low permeability 
and limited functional groups. Skin, for in- 
stance, contains dense stratum corneum, 
limiting the penetration and bonding of bio- 
adhesive agents (Fig. 1A). These issues are 


particularly critical for bioadhesives based on 
physical interactions such as polymer inter- 
penetration; the polymers are too slow and 
even impossible to diffuse and entangle with 
tissues (6), resulting in poor bioadhesion (7). 
Chemical strategies have thus far been pri- 
marily used for tough bioadhesion. Despite 
achieving high adhesion energy, they do not 
enable high-level control over bioadhesion 
in space and time. Exceptions require so- 
phisticated surface patterning, exogenous 
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chemicals, and an external apparatus to re- 
move interfacial bonding (8, 9). Other dis- 
advantages include interference with payloads 
for drug delivery (0), low fatigue threshold 
(caused by limited functional groups on tissue 
surfaces) (17), and acute and/or chronic toxic- 
ity (caused by chemical reactions and their 
reagents) (72). 

To form and control tough bioadhesion, 
our strategy leverages ultrasound (US) and 
anchoring agents such as polymers and nano- 
particles. US has been extensively used in the 
clinical setting for imaging (13), monitoring 
(14), tumor ablation (15), and drug delivery 
(16). The anchoring agents can form a physi- 
cally cross-linked network but do not pene- 
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Fig. 1. Robust and versatile A 
US-mediated tough bioadhesion. 
(A) Schematic of skin with barrier 
effects limiting passive diffusion 
and impairing bioadhesion. (B) US 
actively propels and anchors primer 
agents into a tissue substrate, 
causing spatially confined tough 
adhesion between hydrogel 

and tissue. (C) Representative 
force-displacement curves 

of hydrogel-tissue (porcine skin) 
hybrids with or without US treat- 
ment in peeling tests. (D) US 
enables diverse anchoring agents 
for tough bioadhesion on skin. 

Chi, chitosan; ChsNC, chitosan 
nanocrystal; CHO-CNC, 

aldehyde functionalized cellulose 
nanocrystal. (E) Adhesion 
enhancement with US on diverse 
biological tissues, including skin, 
buccal mucosa, and aorta. Chitosan 
was used as the anchoring primer. 
Data are shown as means + SD for 
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trate tissues by passive diffusion (Fig. 1A) (6). 
We hypothesized that US could propel and 
immobilize the anchoring agents deep into 
tissues, which would form a bridging net- 
work at the interface when a hydrogel patch is 
applied (Fig. 1B). This strategy of US-mediated 
bioadhesion differentiates from existing strat- 
egies based on passive diffusion, chemical 
reactions, or invasive mechanical interlock- 
ing, including those in medical devices such 
as suture anchors and swellable microneedles 
(77) (fig. S1). 

US-mediated bioadhesion was achieved in 
two steps. First, we used an ultrasonic trans- 
ducer (20 kHz) to apply US to a primer solution 
or suspension of anchoring agent spread on a 
tissue substrate (e.g., freshly excised porcine 
skin). Second, we covered the treated area 
with a hydrogel patch, which triggers gelation 
of the anchoring agent at the interface (Fig. 1B). 
As a model system, we deployed a chitosan 
solution and a polyacrylamide-alginate (PAAm- 
alg) hydrogel as the primer solution and the 
hydrogel patch, respectively. We measured the 
adhesion energy between the hydrogel and 
the tissue with peeling tests (Fig. 1C). With 
US treatment (116 W cm~”, 1 min), the ad- 
hesion energy obtained on porcine skin was 
~1750 J m?, >15 times that of the no-US con- 
trol (Fig. 1D). The adhesion energy reached 
~100 J m~? within a minute and then pla- 
teaued in 10 min (fig. S2). The anchoring agent 
is critical for tough bioadhesion because merely 


Ultrasonic activation 


Low adhesion 


Adhesion energy (J m*) 


SO ee es ; 
propelling and anchoring Robust tissue-hydrogel adhesion 


Ultrasound-induced cavitation 


replacing it with phosphate-buffered saline 
yielded weak bioadhesion, even with US treat- 
ment (fig. S3). The adhesion performance was 
dependent on the pH difference between the 
hydrogel patch and the primer solution (fig. $4) 
but independent of blood exposure on tissue 
substrates (fig. S5). We also used an ultrasonic 
scaler (20 to 35 kHz) like the type used in dental 
clinics to obtain tough adhesion (~800 J m) 
on porcine skin with US treatment (fig. S6). 
Our strategy is applicable to a large rep- 
ertoire of materials. The adhesion enhance- 
ment by US was confirmed with different 
hydrogels, including a double-network poly(N- 
isopropylacrylamide)-alginate (PNIPAm-alg) 
hydrogel and a single-network PAAm hy- 
drogel; the variance of adhesion energy can 
be linked with the bulk toughness of the hy- 
drogel (fig. S7). We also demonstrated US- 
mediated bioadhesion with various anchoring 
agents, including gelatin, ChsNC, and CHO- 
CNC (Fig. 1D), and the different adhesion 
performances could be related to the charge 
density and structure of anchoring agents. 
Gelatin could lead to >200 J m™® of adhesion 
energy with US treatment, markedly higher 
than that without US (~26 J m™°). CHO-CNC 
yielded high adhesion energy (~180 J m~”) 
with US treatment, whereas the condition 
without US yielded weak adhesion (~12 J m~). 
ChsNC could not penetrate deep into skin 
without the aid of US (fig. S8). The adhesion 
of ChsNC without US remained weak even 
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when they formed amide bonds with tissues 
(fig. S9). When the US was applied, the ad- 
hesion energy increased to 500 J m™, two 
orders of magnitude higher than the same 
material without US treatment (~5 J m7). 

In addition to the skin, our strategy is appli- 
cable to other biological tissues, including 
buccal mucosa and aorta (Fig. 1E). The mea- 
sured adhesion energy was ~295 J m~ for 
buccal mucosa and ~297 J m” for aorta; the 
values for no-US conditions were ~12 and 
~17 J m~’, respectively. Tough bioadhesion 
was evidenced by the debonding of buccal 
membrane from the underlying tissues during 
peeling (fig. S10), indicative of strong adhesion. 


The tissue-dependent adhesion performance 
can be related to the mechanics and chemistry 
of specific tissues (78). The versatility of US- 
mediated bioadhesion and its indispensable 
role in nanoparticle bioadhesion unlock the 
potential of various materials for tough bio- 
adhesion (fig. S11) (79). 

We investigated the mechanism underlying 
US-mediated bioadhesion as follows. The US 
exerts mainly thermal and mechanical effects 
on the primer solution and the tissue sub- 
strate. Given the US intensity used in this 
study, we detected minimal change of the US- 
treated chitosan in terms of structure and 
gelling behavior (fig. S12) (20). To evaluate the 


thermal effect, we monitored the temper- 
ature change using a thermal camera. After 
a 1-min application of 16 W cm? US, the sur- 
face temperature of porcine skin increased 
by ~10°C (fig. S13). To determine the effects of 
this, we replicated the temperature change 
with a temperature-controlled oven to incu- 
bate hydrogel-primer-tissue hybrids. Because 
no significant change of bioadhesion was found 
(fig. S14), we excluded the thermal effect. We 
then focused on the mechanical effects of US 
such as cavitation, viscous stress, and acoustic 
radiation force. Of these, cavitation was hypothe- 
sized to play a key role (27) because viscous 
stress and acoustic radiation force are less 
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Fig. 2. US-induced cavitation regulates bioadhesion. (A) Experimental 
setup for characterizing US-induced microbubble cavitation. (B) Digital 
images of the microbubble cloud at peak intensity in a cycle captured 

by the high-speed camera. Scale bar, 100 um. (C) Processed binary images 
of the bubble clouds. (D) Normalized peak intensity of the induced 
microbubble clouds as a function of the US intensity. (E) Representative 
force-displacement curves of hydrogel-tissue hybrids in lap-shear tests. 

(F and G) Correlation between applied US intensity and interfacial adhesion 
energy (F) and shear strength (G). (H) Correlation between peak intensity of 
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bubble clouds, interfacial adhesion energy, and shear strength. (1) Schematic 
illustration of the fatigue test of hydrogel-tissue hybrid. Data are shown as 
means + SD for n = 3 independent experiments. (J) Representative curves of 

the cycle number and maximum displacement per cycle at two energy release rates 
for samples with US treatment. (K) Crack extension rate (dc/dN) versus applied 
energy release rate G=F/W for hydrogel adhesion on tissues after no US or 
exposure to low (16 W cm™)- or high (116 W cm”)-intensity US. The linear 
extrapolation to the G-axis (solid lines) gives the fatigue threshold To. (L) Correlation 
of US intensity and obtained fatigue threshold To. 
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Fig. 3. Spatial control of US-mediated bioadhesion. (A) Schematics of US 
application at varying distances d between the US horn and the tissue substrate. 
(B) Correlation of d and the adhesion energy characterized by lap-shear tests. 
(C) Images of porcine skin where a clear boundary between tough adhesion 
and nonadhesion regions can be observed. Scale bar, 1 cm. (D) Correlation of d 
and tough adhesion area measured experimentally (green dot) and computed 


significant for the low-frequency US used here 
(see the supplementary text and fig. S15). To 
test this hypothesis, we combined experiments 
and theoretical modeling to study the link be- 
tween cavitation and bioadhesion. 

We first characterized cavitation and bio- 
adhesion as a function of US intensity. The 
cavitation was manifested with dynamic micro- 
bubble clouds present under the US transducer 
(Fig. 2, A and B). A high-speed camera revealed 
the pattern and geometry of oscillating vapor 
bubble clouds. The peak cavitation intensity 
was quantified with the maximum area of 
microbubble cloud in each cycle (Fig. 2C). The 
US substantially enhanced the peak intensity 
of cavitation, but the effect diminished when 
the intensity was high (Fig. 2D). A similar 
trend in interfacial adhesion energy was ob- 
tained in modified lap-shear tests (Fig. 2E). 
The adhesion energy and shear strength be- 
tween hydrogel and porcine skin increased 
with the US intensity (Fig. 2, F and G), and 
both were shown to correlate linearly with the 
peak bubble intensity (Fig. 2H). The adhesion 
energy scaled almost linearly with the dura- 
tion of US treatment (fig. S16), further corrob- 
orating the correlation between cavitation and 
bioadhesion. 

This correlation can be explained as follows. 
When the bubbles collapse and jet, the an- 
choring agents are propelled and anchored 
into tissue substrates, enabling interfacial 
bonding with the hydrogel patch. In support 
of this point, we visualized the single-bubble 
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dynamics with laser-induced cavitation ex- 
periments, showing that the laser-induced 
bubble collapses and jets toward the tissue 
substrate (fig. S17 and movie S1); US-induced 
microbubble jetting was suggested by pit 
generation in aluminum foil (fig. S18). There 
might also be contributions from other me- 
chanical effects of the US such as viscous 
stress, which requires further investigation. 
To further study the US effect on interfacial 
bonding, we performed fatigue fracture tests. 
Such tests output interfacial fatigue threshold 
or intrinsic work of adhesion T9, which scales 
with the density of interfacial bonds according 
to the Lake-Thomas theory (/8). In fatigue tests 
under 180° peeling configuration (Fig. 21), we 
varied the magnitudes of cyclic loads applied 
onto the specimens for different energy release 
rates (G) and monitored the crack extension 
over cycles (de/dN) (Fig. 2J). The crack growth 
rate increased with the loading G (Fig. 2K). 
The linear regression of the G-dc/dN curves 
informed the intrinsic work of adhesion (1). 
The US treatment raised Fy from ~5 J m~ 
(no-US) to ~65 J m~? (Fig. 2L). With the ab- 
sence of covalent bonding at the interface, the 
high Tp exceeded the case of forming inter- 
facial amide bonds through carbodiimide 
chemistry (~25 J m~) (11). These results sub- 
stantiate the existence of strong interfacial 
interactions, which are typically only observed 
with covalent bonding, in contrast to the often 
weak physical interactions resulting from in- 
terdiffusion, such as entanglement. 


Radial position (mm) 


Radial position (mm) 


theoretically (blue circle). Data are shown as means + SD for n = 3 independent 
experiments. (E) Side view of theoretically computed fields. The area where 

p < py during part of the acoustic cycle is shown by a red dashed line for d = 1, 3, 
and 5 mm. (F) Corresponding top view at substrate level for d = 1, 3, and 5 mm. 
Red arrowhead indicates the boundary of tough adhesion area. Blue areas are 
where total pressure p goes below vapor pressure py according to theory. 


We next showed that the US treatment me- 
diates tough bioadhesion. Because the US 
effects scaled with the distance between the 
transducer and the tissue (d), simply maneuver- 
ing the US transducer could control the bio- 
adhesion in magnitude and space (Fig. 3A). 
The adhesion energy decreased with increasing 
d and eventually to the level without US when 
d>4mm (Fig. 3B). The distance d also mediated 
the area of bioadhesion (Fig. 3C). The tough 
adhesion area decreased with the gap be- 
tween the transducer and the tissue (Fig. 3D). 
To understand and predict the spatially con- 
trolled bioadhesion, we conducted theoretical 
modeling on the acoustic field produced by 
the US transducer between the horn and the 
substrate (see the supplementary text); the 
experimentally measured displacement of 
the US transducer was used in the model 
(fig. S19). We extracted the area on the sub- 
strate where the absolute pressure dropped 
below the vapor pressure in every acoustic 
cycle, thereby enabling the formation of cav- 
itation bubbles (Fig. 3, E and F). At various 
values of d, we obtained substantially different 
pressure profiles on the tissue substrate from 
which the regions affected by cavitation were 
estimated (Fig. 3D). The computational results 
agreed with the experimental measurements. 
The spatial resolution of bioadhesion could be 
further improved with focused-US technology 
(22). In addition to demonstrating spatial con- 
trol, we demonstrated that US could enable 
temporal control over bioadhesion by using 
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Fig. 4. In vivo assessments. (A) Schematics of the procedures of US-mediated 
bioadhesion with a rodent model. (B) Spatial control of bioadhesion on rat skin. 
Blue dotted circle indicates the adhesion region. Scale bar, 1 cm. (©) Adhesion 
energy between rat skin and hydrogel without or with US treatment (16 W crm’). 
Data are shown as means + SD for n = 3 independent experiments. (D and 

E) Representative hematoxylin and eosin-stained images of the hydrogel-rat skin 


the heating effect of US and a thermo-gelling 
gelatin as the anchoring primer (see the sup- 
plementary text and fig. S20). The controlla- 
bility is particularly desired because current 
bioadhesives are limited by homogeneous 
adhesiveness, poor control over the diffusion 
of adhesive agents, and complications asso- 
ciated with patterning adhesives (2, 8, 9). 

Finally, we validated the safety and efficacy 
of US-mediated bioadhesion with a rodent 
model in vivo (Fig. 4A). This animal study is 
critical because the mechanical index for the 
high-frequency US safety criteria for imag- 
ing applications is not applicable to the low- 
frequency US used in this study (16). We tested 
both chitosan and ChsNCs to demonstrate 
in vivo applicability. Tough bioadhesion 
formed within minutes selectively on a cir- 
cular region treated with US, indicative of 
spatially controlled adhesion (Fig. 4B). By 
testing freshly excised rat skin, we confirmed 
the high adhesion energy achieved with US 
in vivo (Fig. 4C). Histological assessments 
concluded that there was no marked tissue 
damage or acute inflammation caused by US 
(16 W cm”) or bioadhesives (Fig. 4, D to F). 
Rodent skin is more sensitive to the heating 
of high-intensity US compared with porcine 
skin (see the supplementary text and fig. S21). 
Future study is needed to establish the US 
conditions (e.g., intensity and profile) for clin- 
ical translation. 
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Hydrogel application 


Here, we report US-mediated bioadhesion 
to precisely control hydrogel bioadhesion in 
space and time. Along with drug-eluting hy- 
drogels, our strategy enables tough bioadhesion 
and transdermal delivery of protein concur- 
rently (see the supplementary text and fig. S22). 
The universal applicability of our strategy 
promises impacts in broad areas ranging from 
wearable devices to drug delivery. 
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severe inflammation. Statistical significance and P values were determined 
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Electron in a cube: Synthesis and characterization of 
perfluorocubane as an electron acceptor 


Masafumi Sugiyama’, Midori Akiyama‘, Yuki Yonezawa’, Kenji Komaguchi2, Masahiro Higashi’, 


Kyoko Nozaki‘, Takashi Okazoe"* 


Fluorinated analogs of polyhedral hydrocarbons have been predicted to localize an electron within 
their cages upon reduction. Here, we report the synthesis and characterization of perfluorocubane, 

a stable polyhedral fluorocarbon. The key to the successful synthesis was the efficient introduction of 
multiple fluorine atoms to cubane by liquid-phase reaction with fluorine gas. The solid-state structure of 
perfluorocubane was confirmed using x-ray crystallography, and its electron-accepting character was 
corroborated electrochemically and spectroscopically. The radical anion of perfluorocubane was 
examined by matrix-isolation electron spin resonance spectroscopy, which revealed that the unpaired 
electron accepted by perfluorocubane is located predominantly inside the cage. 


or organic chemists, polyhedral mole- 
cules such as cubane (7), dodecahedrane 
(2), and buckminsterfullerene (3) repre- 
sent attractive synthetic targets beyond 
their arguably subjective elegance. After 
the syntheses and structural characterizations 
of these polyhedral cages were achieved, inter- 
est quickly shifted to the possibility of encap- 
sulating guests within their internal cavities. 
Early examples included the encapsulation of 
a variety of atoms in fullerene (4) and dode- 
cahedrane (5) as well as the encapsulation of 
a molecule in fullerene (6, 7). Meanwhile, 
polyhedral fluorocarbons (CF),,—i.e., the per- 
fluorinated analogs of polyhedranes—have 


become a focal point of attention and the sub- 
ject of numerous theoretical studies (Fig. 1A). 
One of their most interesting features is their 
electron-accepting character inside their cage, 
which arises from a stabilized vacant orbital 
within the cage derived from multiple o* or- 
bitals of C-F bonds (8-10). This internal local- 
ization of electrons stands in stark contrast 
to common z-conjugated electron acceptors, 
which usually host electrons on their molecu- 
lar surfaces. Although some experimental evi- 
dence that supports the formation of (CF), 
(where 7 = 20 or 60) has been reported (11, 12), 
isolation as a single isomer has not yet been 


achieved because of the difficulties associated 


with the exhaustive fluorination of all cage 
vertexes and the latent instability of (CF), on 
account of the overcrowded fluorine atoms 
(13-15). By contrast, perfluorocubane [(CF)s, 
1], whose structure was proposed in 2004 (16), 
should be sufficiently stable for isolation given 
that the vicinal fluorine atoms are sterically 
less hindered than those in larger polyhedranes. 
Moreover, a theoretical study by Irikura (8) 
and our own density functional theory (DFT) 
calculations have suggested that 1 can be ex- 
pected to exhibit pronounced electron-accepting 
character [electron affinity = 1.6 eV (8); energy 
of the lowest unoccupied molecular orbital 
(E,umo) = —2.8 eV; Fig. 1B]. Here, we report 
the synthesis, isolation, and characterization 
of 1 as a polyhedral fluorocarbon that can 
accept and store an electron within its internal 
cubic cavity (i-e., electron in a cube). 

As a synthetic route to 1, the stepwise intro- 
duction of eight F atoms onto cubane is im- 
practical; in fact, only two F atoms have been 
successfully introduced into a cubane scaffold 
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Fig. 1. A perfluorinated cube-shaped molecule accepts an electron inside its cage. (A) Schematic representation of the encapsulation of electrons, atoms, or 
molecules inside cage-shaped molecules. Polyhedral fluorocarbons can accept an electron inside their cage, whereas common z-conjugated electron acceptors usually 
store electrons on their surface. (B) Structure of perfluorocubane (1) and its predicted properties. The energy level of 1 was calculated at the B3LYP/6-311++G(d,p) 
level of theory, and the predicted electron affinity has been reported previously (8). (©) Previously reported related molecules and synthetic strategy for this study. 
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Fig. 2. Synthetic routes to A 
fluorinated cubanes. (A) Prepa- 
ration of heptafluorinated com- 
pound 5 and octafluorinated 1. The 
yield for 3 in the direct fluorination 
represents an average value 

of six experiments, and the 

error (to) was obtained from the 
standard deviation (o). THF, 
tetrahydrofuran; LiHMDS, lithium 
bis(trimethylsilyl)amide; NFSI, 4 
N-fluorobenzenesulfonimide; 

Bn, benzyl. (B) Preparation of 

hexafluorinated compound 7. 


ce ye ao 2% m , Sy 


so far (Fig. 1C) (77, 18). Several per-substituted 
cubanes have been synthesized through the 
[2+2] photocycloaddition of the correspond- 
ing octa-substituted syn-tricyclo[4.2.0.07”Jocta- 
3,7-dienes (19, 20). Although it is feasible 
to conceive the synthesis of 1 from syn- 
octafluorotricycloocta-3,7-diene, its conversion 
to 1 has not yet been reported (21, 22). Against 
this background, the introduction of multiple 
fluorine atoms through radical C-H fluorina- 
tion using fluorine gas might seem the most 
promising strategy to synthesize 1. However, a 
reported attempt to fluorinate cubane using 
fluorine gas resulted in ring opening owing to 
the highly strained nature of the cubic skele- 
ton (23). We envisaged that the C-H fluori- 
nation of cubane could be achieved using a 
modified version of the liquid-phase direct 
fluorination designated the “PERFECT” meth- 
od, in which the direct fluorination of a par- 
tially fluorinated ester with an excess of fluorine 
gas in a fluorous solvent affords the corre- 
sponding perfluorinated product with marked 
suppression of C-C bond cleavage (24, 25). 
This synthetic approach enabled the system- 
atic preparation of octa-, hepta-, and hexa- 
fluorocubanes, which enabled study of the 
effect of the degree of fluorination of cubane 
on its electron affinity. 

The synthetic route to 1 is shown in Fig. 2A. 
A solution of cubanemonoester 2 in 1,2-dichloro- 
3-(2-chloro-1,1,2,2-tetrafluoroethoxy)-1,1,2,3,3- 
pentafluoropropane (CFE-419) was treated 
at -20°C with an excess of fluorine gas. After 
the reaction, three new peaks were observed 
in the '°F nuclear magnetic resonance (NMR) 
spectrum with an area ratio of 3:3:1, which 
indicated the formation of the heptafluoro- 
cubane monoester (8) with an average NUR 
yield of 15% (for details, see the supplementary 
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Fig. 3. Crystal structures of fluorocubanes. (A) Thermal ellipsoid plots of 1, 4, and 7 (ellipsoid probability, 
50%). (B) Packing structure of 1 in the unit cell. Two molecules in the unit cell are depicted using a space-filling 
model to show the manner of interaction (carbon, white; fluorine, yellow). (C) Hirshfeld surface for dimers 

of 1 in the crystal structure mapped using dorm (left), together with a hypothetical illustration of the 
interactions in the dimers (right). (D and E) Schematic illustration of noncovalent o-hole interactions. 
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Fig. 4. Properties of fluorocubanes. (A) UV-vis 
absorption spectra of 1 (red), 5 (blue), and 7 
(gray) in MeCN; the vertical dashed lines 
represent the theoretically predicted absorption 
coefficients and wavelengths calculated at the 
B3LYP/6-311++G(d,p) level of theory. The 

red asterisk indicates the predicted absorption 
wavelength of 1, for which the predicted oscillator 
strength f is 0.00 given that it is symmetry 
forbidden. (B) LUMOs and the energy levels of 
frontier orbitals for 1, 5, and 7, calculated 

at the B3LYP/6-311++G(d,p) level of theory. 

Red and green areas depict different phases of the 
plotted orbitals. (©) ESR spectrum of lina 
hexamethylethane matrix after y-irradiation at 

77 K (black) and simulated ESR spectrum (red) 
using g = 1.9985 and a(!9F) = 19.62 mT (8F). 

Exp, experimental; Sim, simulated. (D) Spin 
density (contour = 0.005; top) and singly occupied 
molecular orbital (SOMO) (isovalue = 0.05; 
bottom) in the radical anion of 1, calculated at 
the UB3LYP/6-311++G(d,p) level of theory. 


materials). Ester 3 was subsequently subjected 
to transesterification without isolation, and 
benzyl heptafluorocubanecarboxylate (4) was 
successfully isolated. Upon hydrolysis of 4, a 
subsequent decarboxylation furnished hepta- 
fluorocubane (5; 76% yield). The decarboxyla- 
tion of perfluorinated tertiary carboxylic acids 
is known to occur without requiring any other 
reagents (26). The reaction of 5 with lithium 
bis(trimethylsilyl)amide (LiIHMDS) and N- 
fluorobenzenesulfonimide (NFSI) gave 1in 
51% yield. The ‘°F and the ’°F-decoupled °C 
NMR spectra of 1 in acetone-dg each feature 
one singlet peak ['°F: -197.19 parts per million 
(ppm); ’C{'°F}: 103.8 ppm], and a cross peak 
between these two peaks was observed in the 
°F.18C heteronuclear multiple quantum co- 
herence (HMQC) spectrum. The Fourier trans- 
form infrared (FTIR) spectrum in the range 
400 to 4000 cm’ contained only two peaks 
at 1371 and 798 cm’, which reflects the high 
symmetry (O, point group) of 1. In the mi- 
croscopic Raman spectrum, the characteristic 
peak corresponding to the breathing vibration 
mode of 1 was observed at 552 cm’. Hexa- 
fluorocubane (7) was also synthesized from 
cubanediester 6 (32%, in three steps) using a 
similar procedure (Fig. 2B and the supplemen- 
tary materials). 
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The structures of fluorocubanes 1, 4, and 7 
were unambiguously determined using single- 
crystal x-ray diffraction analysis (Fig. 3A). The 
identical lengths (1.570 A) of the 12 C-C bonds 
in 1 clearly indicate a nondistorted cubic 
structure in the solid state, and this length was 
almost identical to a previously reported the- 
oretical value that was obtained from high- 
level DFT calculations (27). The bond length 
was also almost identical to that of the parent 
cubane CgHg (1.572 A) (28). According to Bent’s 
rule, atoms tend to direct the bonding of 
hybrid orbitals of greater p character toward 
electronegative substituents, such as fluorine. 
As a result, the s character of the C-C bond is 
increased, and the bond is shortened. By con- 
trast, the average C-C bond expands with in- 
creasing numbers of fluorine atoms, which 
is the result of the repulsion between vicinal 
fluorine atoms (29). These two opposing effects 
appear to cancel each other out, and hence 
1 shows a C-C bond length that is similar to 
that of the parent cubane (supplementary 
materials). In the crystal structure, 1 exhibits a 
characteristic intermolecular interaction. One 
fluorine atom of 1 interacts with the center 
of the cyclobutane ring of the nearest mole- 
cule, as depicted using a space-filling model in 
Fig. 3B; this interaction is also clearly observed 


ow 


o"'9 


SOMO (17) 


in the Hirshfeld surface analysis (Fig. 3C) (30). 
The interaction energy was estimated to be 
3.5 kcal/mol using natural energy decompo- 
sition analysis (31). This type of interaction 
has recently been recognized as a noncovalent 
carbon-bonding interaction, which is a subset 
of tetrel bonding. Specifically, carbon bonding 
represents an interaction between a Lewis- 
acidic o-hole centered on a carbon atom and 
a Lewis base, akin to halogen bonds (Fig. 3D) 
(32). The only reported example of a synthe- 
sized carbon-bonding donor is tetracyanocy- 
clopropane (33). Bauza et al. have predicted 
that 1 can be expected to act as a strong carbon- 
bonding donor that interacts with anions such 
as cyanide (Fig. 3E) (34), and the packing 
structure of 1 in the single crystal obtained 
in the present study strongly supports this 
prediction. 

Fluorocubanes 1, 5, and 7 showed ultraviolet- 
visible (UV-vis) absorption peaks at wavelengths 
longer than 160 nm owing to their lower-lying 
LUMOs compared with those of other fluoro- 
alkanes, which are transparent at 160 nm (35). 
As shown in Fig. 4A, 1 exhibits the longest 
absorption wavelength edge among the three 
fluorocubanes, and the edges of 5 and 7 shift 
hypsochromically with decreasing numbers of 
fluorine atoms. DFT calculations at the BBLYP/ 
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6-311++G(d,p) level of theory revealed that the 
LUMO of each fluorocubane comprises the o* 
orbitals of the C-F bonds and that the energy 
of the LUMO decreases with increasing num- 
bers of fluorine atoms (Fig. 4B). The results of 
time-dependent DFT calculations suggest that 
the absorptions of 5 and 7 should most likely 
be attributed to the transition from the doubly 
degenerate the second-highest occupied molec- 
ular orbital (HOMO-1), which consists of the 
C-C o bonds of the cubane scaffold, to the 
LUMO. The bathochromic shift of the absorp- 
tion peak of 5 (231 nm) compared with that of 7 
(200 nm) is the result of the lower LUMO level 
of 5 relative to that of 7 and consistent with the 
calculated values (5: 247 nm; 7: 211 nm). 
Perfluorocubane (1) has a triply degenerate 
HOMO because of its highly symmetric struc- 
ture, and the predicted oscillator strength of 
the HOMO-LUMO transition is 0.00 (Fig. 4B, 
red asterisk). This can be rationalized in terms 
of the point group of 1 (O;,), which renders the 
transition from the HOMO (t,,,) to the LUMO 
(aig) Symmetry forbidden (16). In the experi- 
mental UV-vis absorption spectrum of 1, a 
weak absorption was observed at a wave- 
length similar to the calculated HOMO-LUMO 
transition (278 nm), which can be caused by 
vibronic coupling (fig. S11 and the supple- 
mentary materials). 

The reduction potentials of 1 and 5 (1: -2.1 V; 
5: -2.5 V) were experimentally determined 
using differential pulse voltammetry (DPV) 
measurements (fig. S14). The LUMO levels of 
1and 5 were estimated to be —-2.7 and —2.4 eV, 
respectively, corrected by comparison with 
the HOMO level of ferrocene (-4.8 eV relative 
to the vacuum level). Both values are in good 
agreement with the corresponding calculated 
values (1: -2.8 V eV; 5: -2.2 eV). In cyclic 
voltammetry (CV) measurements, both 1 and 
5 showed an irreversible reduction wave in 
the range of —2.1 to -2.9 V (fig. S15), which 
likely indicates that 1 and 5 accept an elec- 
tron and that the resulting radical anions are 
unstable under the applied conditions and 
decompose. 

To confirm the structure of the radical anion 
generated upon accepting an electron into the 
LUMO of I, derived from the eight C-F o* or- 
bitals, we used matrix-isolation electron spin 
resonance (ESR) spectroscopy in combination 
with y-ray radiolysis at low temperatures. This 
technique has already been used to observe un- 
stable radical anions of perfluorocycloalkanes 
(36). As shown in Fig. 4C, the ESR spectrum 
of 1°" in a hexamethylethane (HME) matrix at 
77 K is well resolved. Although the innermost 
peaks overlap with the intense signals of the 
HME matrix radicals in the congested central 
region, the other eight peak bands were ob- 
served in accordance with the total nuclear 
spin quantum number, Mj = +4, +3, +2, and 
+1. Because of the large a(’°F) hyperfine 
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coupling constant (hfcc), the theoretically 
predicted hyperfine structure characteristic 
of second-order splitting could be confirmed 
(37). The isotropic line shape was well sim- 
ulated by assuming identical coupling of the 
eight equivalent °F nuclei. This result suggests 
that the observed radical anion of 1 rotates rap- 
idly even in the glass-state matrix at 77 K. The 
hfcc (19.62 mT) is in accord with the value 
based on the calculated structure (19.24 mT), 
in which the spin density is mainly distributed 
inside the cubane cage (Fig. 4D). 

Previous polyhedral molecules have found 
widespread applications in functional mate- 
rials after their synthesis. Thus, our study, 
which demonstrates that perfluorinated cage 
compounds act as electron acceptors, can be 
expected to pave the way for the molecular 
design of distinctive functional organic ma- 
terials [a connection to the environmental 
accumulation of per- and polyfluoroalkyl sub- 
stances (PFAS) cannot be discarded at present, 
and attention should be paid to this issue 
when devising applications of such fluorinated 
cubanes] (38). 
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SPEECH EVOLUTION 


Evolutionary loss of complexity in human vocal 
anatomy as an adaptation for speech 
Takeshi Nishimura’*, Isao T. Tokuda’, Shigehiro Miyachi??, Jacob C. Dunn*>°, Christian T. Herbst”°, 
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Hirohiko Imai’, Tetsuya Matsuda’, Ole Nzesbye Larsen®, Uwe Jiirgens®, Hideki Hirabayashi’°, 


Shozo Kojima’, W. Tecumseh Fitch®* 


Human speech production obeys the same acoustic principles as vocal production in other animals 
but has distinctive features: A stable vocal source is filtered by rapidly changing formant frequencies. 
To understand speech evolution, we examined a wide range of primates, combining observations of 
phonation with mathematical modeling. We found that source stability relies upon simplifications in 
laryngeal anatomy, specifically the loss of air sacs and vocal membranes. We conclude that the 
evolutionary loss of vocal membranes allows human speech to mostly avoid the spontaneous nonlinear 
phenomena and acoustic chaos common in other primate vocalizations. This loss allows our larynx to 
produce stable, harmonic-rich phonation, ideally highlighting formant changes that convey most 
phonetic information. Paradoxically, the increased complexity of human spoken language thus followed 


simplification of our laryngeal anatomy. 


peech is the dominant mode of human 

linguistic expression, and for most of 

our evolutionary history, until the emer- 

gence of signed and written languages, 

speech provided the sole communicative 
modality for language. Speech production is 
based on the same acoustic and physiological 
principles as vocal production in other terres- 
trial vertebrates (J-4) but nonetheless pos- 
sesses distinctive attributes. First, our vocal 
source (produced by laryngeal vocal fold oscil- 
lations modulating air flow from the lungs; 
Fig. 1, A and B) is uncharacteristically stable, 
and nonpathological adult speech completely 
avoids the nonlinear phenomena and bifurca- 
tions to chaos—i.e., irregular oscillations and 
abrupt frequency transitions—commonly seen 
in most other mammals (5, 6). This stability, 
combined with enhanced neural control of 
laryngeal muscles (7), yields a highly reliable 
fundamental frequency ({, or “pitch”) and rich 
array of harmonics (5, 8, 9). The predictable, 
broadband acoustic energy in this source sig- 
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nal provides the backdrop upon which rapid 
modulations of our vocal tract filter yield a 
dynamic pattern of formant frequencies con- 
veying phonetic information. Together, under 
fine neural control, this human source-filter 
system provides the high-bandwidth commu- 
nicative signal needed to rapidly encode com- 
plex linguistic information. 

Speech-related specializations of the human 
vocal tract, including the descent of the tongue 
root into the pharynx, are well documented 
(10-12), and their acoustic effects well under- 
stood (5, 8-11), but evolutionary changes in 
our larynx have been relatively neglected. First, 
humans lost the laryngeal air sacs seen in 
other great apes (6, 13), and which were prob- 
ably still present in Australopithecus (14). Sec- 
ond, we show here that vocal membranes (also 
known as “vocal lips”)—thin upward projec- 
tions of the vocal folds (Fig. 1C)—are typical 
laryngeal features in primates but were lost 
in humans (Fig. 1B). Using a combination of 
multidisciplinary methods, we show that vocal 
membranes increase nonlinearities, yielding 
vocal instability. This leads to the surprising 
conclusion that the increased stability of hu- 
man phonation results from an evolutionary 
loss of anatomical complexity. Although fossil 
indicators of vocal fold anatomy are unavail- 
able, our comparative data indicate that this 
simplification through loss must have occurred 
since our divergence from chimpanzees roughly 
6 million years ago. 

We show that a vocal membrane is a key 
anatomical feature shared by all nonhuman 
anthropoid primates (“primates” hereafter), in- 
cluding hominoids or apes, cercopithecids 
or Old World monkeys, and platyrrhines or 
New World monkeys (see Fig. 1D for phyletic 
relationships). We used magnetic resonance 
imaging (MRI) and computed tomography 


(CT) to examine larynges from 25 genera and 
43 species of primates (table S1). All non- 
human species possess a vocal membrane 
and often exhibit a shallow sulcus separating 
the vocal membrane from the vocal fold (Fig. 
1C). Platyrrhines usually have a tall vocal 
membrane (Fig. 1, D to G, and fig. S1, A to 
G), whereas cercopithecids have a shorter lip- 
like membrane (Fig. 1, D and H to J, and fig. 
S1, H to L). This feature can vary even within a 
single species in hominoids (Fig. 1, C, D, and 
K to M; and fig. SI, M to P). In the gibbons 
and siamang (hylobatids), the vocal membrane 
extends from the lateral wall of the laryngeal 
cavity and has become disconnected from the 
vocal fold (Fig. 1K and fig. S1, M to O). 

Given these anatomical data, the most par- 
simonious evolutionary conclusion is that the 
vocal membrane is an ancestral primate fea- 
ture, and that this feature was lost in the hu- 
man lineage to yield the rounded vocal fold 
typical of humans (Fig. 1B). Thus, we argue that 
the absence of a vocal membrane in the human 
larynx is an evolutionarily derived feature. 

Turning to function, the anatomical features 
of the vocal membrane suggest that it plays a 
role in phonation. When the glottis (the air 
space between the vocal folds) is closed for 
phonation by adducting the arytenoid carti- 
lages, both the vocal folds and membranes 
move toward the glottal midline, implying 
that the vocal membranes should also vibrate 
during phonation in primates. 

Supporting this hypothesis, vocal membrane 
vibration was documented in vivo during pho- 
nation in a chimpanzee, the closest phylo- 
genetic relative of humans. Hayama et al. 
examined reflex glottal closure using a trans- 
nasal fiberscope in an adult male chimpanzee 
under anesthesia (75). Serendipitously, these 
video data also documented phonation during 
375 grunts or growls as the chimpanzee was 
awakening from anesthesia. We reexamined 
these data and found that the vocal mem- 
branes always vibrate during vocalization, 
colliding to close the glottis (Fig. 2A and 
movie S1). These observations suggest that, 
in chimpanzees, the vocal membrane plays a 
central role in phonation even during low- 
frequency calls and is not simply an accessory 
feature of the vocal folds subserving high-pitch 
vocalization as previously thought (5, 16, 17). 

Because such in vivo evidence is very chal- 
lenging to obtain in chimpanzees, we next 
quantitatively investigated the role of vocal 
membranes in an ex vivo setting, using ex- 
cised larynges from three chimpanzees. Each 
larynx was mounted on a vertical tube supplying 
airflow, and the adductory gestures observed 
in vivo were reproduced by positioning the 
arytenoid cartilages with adjustable prongs. 
Adductory conditions were further modified 
by pulling the thyroid cartilage antero-inferiorly. 
Vocal membrane and fold vibrations driven by 
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varying air pressure were documented by using 
high-speed video, and phonatory dynamics 
were simultaneously documented by using 
time-synchronized acoustic and electroglot- 
tographic recordings. 

The vocal membranes always participated in 
vibration in these chimpanzee ex vivo experi- 
ments (Fig. 2, B to G, and movie 82, A to C) and 
oscillated in a wave-like fashion in the absence 
of vocal fold vibration (Fig. 2, B and C, and 
movie S2A). The vocal membranes also vi- 
brated separately from—but in phase with—the 
vocal folds (Fig. 2, D and E, and movie S2B). In 
such cases, the vocal membranes usually col- 
lided, but the vocal folds rarely did so. Only 
when very firmly adducting and lowering 
the arytenoid vocal process did we observe 
vocal fold collision resembling that of humans. 
In such cases, there was sometimes a phase- 
delayed oscillatory pattern, in which the vocal 
folds first collided and the vocal membranes 
followed (Fig. 2, F and G, and movie S2C). 
Thus, adducting the arytenoid always resulted 
in vocal membrane vibration and often colli- 
sion. By contrast, vibrations of the vocal folds 
alone, without the vocal membrane, were never 
observed in these experiments. In sharp con- 
trast, vocal fold vibrations always play a central 
role in phonation in human speech. 

Further in vivo experiments with two rhesus 
macaques (cercopithecids) and two squirrel 
monkeys (platyrrhines) corroborated the ob- 
servations in chimpanzees. We successfully 
induced vocalization by electrical stimulation 
to the periaqueductal gray and surrounding 
areas of the midbrain in these monkeys under 
anesthesia. In both species, high-speed video 
recordings showed that the vocal membranes 
always vibrated and, in many cases, collided 
during phonation (Fig. 2, H and I, and movies 
S3 to S4). In macaques, as in chimpanzees, the 
vocal folds did not always vibrate (fig. S2, A 
and B, and movie S5A), but the second subject 
did show simultaneous vibrations of both vocal 
membranes and folds, either in phase (fig. $2, 
Cand D, and movie S5B) or phase delayed (fig. 
$2, E and F, and movie S5C). Further ex vivo 
experiments with six macaque larynges repro- 
duced the same patterns (fig. S2, G to L, and 
movie S6). Humanlike vibration of the vocal 
folds alone was not observed in macaques 
ex vivo. These physiological data, combined 
with previous work, provide clear empirical 
evidence that vocal membranes constitute the 
predominant vocal source generator in pri- 
mates, always vibrating and typically collid- 
ing, whereas the contributions of the vocal 
folds are limited or even absent. 

Taken together, the evolutionary loss of the 
vocal membrane transformed the predomi- 
nant source generator from the vocal mem- 
branes to the vocal folds in the human lineage. 
However, understanding the adaptive impor- 
tance of the derived human condition re- 
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Fig. 1. Vocal anatomy and its phylogeny in anthropoids. (A) The cycle of vocal fold vibration in wave 
fashion in humans. (B) Frontal section (gradation inversion, left) and a corresponding line drawing (middle) 
at the level of the dashed line on an excised and formalin-fixed larynx (right) in humans (courtesy of K. Sato). 
(C) Frontal MRI scan (left) and a corresponding line drawing (middle) at the level of the dashed line on vocal 
anatomy reconstructed from MRI scans (right) for a chimpanzee, Pan troglodytes. Blue indicates the 
underlying arytenoid cartilage and green the thyroarytenoid muscle. (D) Variation and phylogeny of the vocal 
membrane and (E to M) MRI-CT frontal scans in the species labeled on (D). Transverse scale bar, 5 mm; 
vertical scale bar, 2.5 mm. Key: ab, body of the arytenoid cartilage; s, sulcus; ta, thyroarytenoid muscle; vef, 
ventricular fold; vf, vocal fold; vm, vocal membrane; and vp, vocal process of the arytenoid cartilage. The 
asterisk (*) indicates a posterior commissure of the vocal membrane and fold. 


quired further investigation of vocal membrane 
function. Although many primate species can 
produce clear harmonically structured calls 
[e.g., marmoset phees (17), macaque coos (18), 
or chimpanzee hoos (79)], even these call types 
often bifurcate to subharmonics or chaos at 
higher intensities (6, 17). Our findings below 
indicate that the primate vocal membrane 
plays an important acoustic role by increasing 
susceptibility to nonlinear phenomena. 
Subharmonics and chaos were observed 
in vivo in both our rhesus macaque (fig. S3, A 
to C, and movie $5, C and D) and squirrel 
monkey data (fig. S3, D and E, and movies S7 


and S8), and also ex vivo in our chimpanzee 
(Fig. 2, F and G, and movie S2C) and rhesus 
macaque data (fig. S2, I and J, and movie 
S6B). The chaotic episodes were associated 
with highly irregular vibrations and collisions 
of the vocal membranes, superimposed upon 
nearly periodic vibrations of the vocal folds 
in chimpanzees ex vivo (Fig. 3, B and C, and 
movies S2D and S3E) and macaques in vivo 
(fig. S3, A to C, and movie S5, C and D). This 
suggests that vocal membrane vibrations play 
a crucial role in generating the high propor- 
tion of subharmonics and chaos empirically 
observed in nonhuman primate vocalization. 
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We used a mathematical model to deter- 
mine the acoustic effects of vocal membrane 
vibration in primates compared with vocal 
fold vibration alone, as typifies humans. In our 
model, the vocal fold is represented by two 
masses coupled with springs (3, 16, 20-22) 
(Fig. 3D). The vocal membrane is modeled as 
an additional reed-like plate that can vibrate 
independently, attached to the upper mass 
(21) (Fig. 3D). We first confirmed that all 
three empirically observed vibration patterns 
can be reproduced by the numerical simula- 
tions in the nonhuman model with vocal 
membranes (fig. S3F). Next, we generated bi- 
furcation diagrams by adducting the vocal 
membranes and folds in the model (Fig. 3, E 
and F). Notably, the phonation threshold pres- 
sure observed at 0.14 kPa in the nonhuman 
model was much lower than that of the hu- 
man model lacking vocal membranes (0.19 kPa). 
This is consistent with previous theoretical 
models of a fixed vocal membrane (J6) and 
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empirical data from marmosets (17), suggest- 
ing that the vocal membrane provides im- 
proved efficiency in phonation and/or allows 
louder and higher-frequency vocalizations in 
nonhuman primates. 

In bifurcation diagrams, we showed that the 
nonhuman model gives rise to various non- 
linear phenomena, including subharmonics 
and chaos, with increasing adduction (Fig. 3E), 
reproducing the chaotic episode observed in 
chimpanzees ex vivo (Fig. 3, B and C, and 
movies S2D and S3E). Similar bifurcations 
were observed with increasing subglottal pres- 
sure in the nonhuman model (fig. S3G). This 
contrasts with the situation in our human 
model, which generated only stable periodic 
vibrations regardless of increasing adduction 
(Fig. 3F) and pressure (fig. S3H). 

The increased instability seen in the non- 
human model is not surprising, because the 
addition of mechanically coupled vocal mem- 
branes contributes additional degrees of 


Fig. 2. Vocal membrane and vocal fold 
vibrations. (A) Images extracted from video 
recordings in vivo in a chimpanzee. From left 
to right: The vocal membranes and folds 

are maximally separated, the glottis is 

closing, and the vocal membranes close the 
glottal space and vibrate. (B to G) Vibration of 
the vocal membrane and vocal fold ex vivo 

in chimpanzees. (B, D, and F) A top-view 
image of the glottis extracted from video 
recordings with the dotted line for 

creating the kymograph (left) and kymograph 
showing a time series of glottal vibration 
(right). (C, E, and G) Electroglottograph 

(upper panel) and acoustic (lower panel) signals. 
(B and C) Only the vocal membranes vibrate, 
whereas the vocal folds do not; and (D and E) vocal 
membrane and fold both vibrate in-phase and 
(F and G) out-of-phase. (H and 1) Images 
extracted from video recordings in vivo in a 
rhesus macaque, Macaca mulatta (H) and 

a common squirrel monkey, Saimiri sciureus (1). 
See Fig. 1 for the anatomical key. 


freedom to laryngeal biomechanics. This dy- 
namically generates multiple routes to voice 
instabilities, as seen in many coupled non- 
linear systems (22-24). Thus, the simulations 
confirmed our empirical observations, indicat- 
ing that the primate vocal membrane, com- 
bined with vocal fold vibration, destabilizes 
the vocal source. Their interaction can spon- 
taneously lead to subharmonics and chaos in 
response to simple linear variations in glottal 
and respiratory parameters. The loss of the 
vocal membrane in humans therefore reduces 
the risk of contaminating the stable vocal fold 
oscillations used in human speech or singing 
with chaotic irregularities and noise. 

In summary, the primate vocal membrane 
interacts with the vocal fold beneath it to ef- 
ficiently generate phonation but simultane- 
ously readily generates nonlinear phenomena 
that destabilize the vocal source. The resulting 
spontaneous phenomena are unlikely to be 
under the animal’s volitional control, but this 
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Fig. 3. Bifurcation observed and numerical simulation of mathematical model. (A) Sound spectrograph 
showing bifurcation to chaos ex vivo in a chimpanzee. (B) Kymographs (left) and electroglottograph 
signals (right) for a stable phonation at 1.2 s of (A); and (C) for the chaotic phonation at 2.7 s of (A). 

(D) Schematic illustration of the nonhuman model with a vocal membrane. (E) Bifurcation diagram created 
by slowly adducting (increasing the contact area) for the nonhuman and (F) the human models. See Fig. 1 


for the anatomical key. 


more complex laryngeal anatomy enables non- 
human primates to efficiently generate a di- 
verse set of complex vocal signals without the 
need for correspondingly complex motor control. 

By contrast, despite its disadvantages for 
efficiency, we suggest that the stable low- 
pitched phonation resulting from the loss of 
the vocal membranes in human evolution en- 
hances the detectability and salience of for- 
mant frequencies, which carry most of the 
phonetic information in human speech. Chaos 
and subharmonics generate an irregular and 
changing spectrum thought to obscure the 
conformation of the vocal tract filter (78, 25) 
by introducing spurious spectral peaks or 
“pseudo-formants” (26). Although nonlinear 
phenomena can occur in humans—e.g., in 
screams and baby cries (27)—they are rarely 
observed during speech except in patholog- 
ical cases (28). We conclude that, in addition 
to changes in vocal tract anatomy and neural 
control, this previously unnoticed simplifi- 
cation of human laryngeal anatomy allowed 
our species to easily generate and regulate 
complex, high-bandwidth vocalizations under 
cognitive control in speech. Thus, seemingly 
paradoxically, a loss of complexity in human 
laryngeal anatomy, coupled with enhanced 
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neural control mechanisms, represented a cru- 
cial evolutionary transition supporting the 
sophistication and complexity of human spo- 
ken language. 
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INSECT MIGRATION 


Individual tracking reveals long-distance flight-path 
control in a nocturnally migrating moth 


Myles H. M. Menz'23*, Martina Scacco’’, Hans-Martin Biirki-Spycher*, Hannah J. Williams??, 
Don R. Reynolds>®, Jason W. Chapman”®?*, Martin Wikelskit??0* 


Each year, trillions of insects make long-range seasonal migrations. These movements are relatively well 
understood at a population level, but how individual insects achieve them remains elusive. Behavioral 
responses to conditions en route are little studied, primarily owing to the challenges of tracking 
individual insects. Using a light aircraft and individual radio tracking, we show that nocturnally migrating 
death’s-head hawkmoths maintain control of their flight trajectories over long distances. The moths 
did not just fly with favorable tailwinds; during a given night, they also adjusted for head and crosswinds 
to precisely hold course. This behavior indicates that the moths use a sophisticated internal compass 
to maintain seasonally beneficial migratory trajectories independent of wind conditions, illuminating how 
insects traverse long distances to take advantage of seasonal resources. 


nsect migration takes place on an enor- 

mous scale, with trillions of individuals 

performing bidirectional seasonal move- 

ments that have important impacts on 

ecosystem function and provision of es- 
sential services (7-5). However, the naviga- 
tional mechanisms and behavioral strategies 
used by night-flying migrants, especially larger 
nocturnal lepidopterans (macromoths), dur- 
ing these long-range journeys have been un- 
known for more than 100 years. 

The view in the first half of the 20th century, 
promoted by C. B. Williams, was that migrant 
moths controlled their movement direction 
irrespective of the wind and maintained 
straight flightpaths over long distances (6, 7). 
Empirical evidence of persistent, self-directed 
tracks was lacking, however, and by the sec- 
ond half of the 20th century, C. G. Johnson 
and L. R. Taylor downplayed the importance 
of orientation behavior and emphasized the 
role of wind in determining migratory trajec- 
tories (8, 9). The modern view has swung back 
again, because radar observations of free- 
flying migrants (10-12) and experimental ma- 
nipulation of tethered individuals (13, 14) have 
both clearly demonstrated that nocturnally 
migrating moths can select adaptive headings 
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and modify them with respect to ambient 
wind conditions. However, owing to the meth- 
odological constraints of tracking such small 
animals over long distances at night (15), 
individual moths have never been tracked 
throughout their migration, and so the capa- 
bility of these migrants to maintain straight 
flight paths, over long distances and in sea- 
sonally beneficial directions, is unknown. 

We used animal-borne radio telemetry to 
record complete tracks of individually tagged 
moths over a full night during autumn mi- 
gration, within the context of the fine-scale 
wind fields experienced as they migrated south- 
ward through the Alps of central Europe. 
Our study species, the death’s-head hawkmoth 
(Acherontia atropos, Sphingidae; Fig. 1A), is 
Europe’s largest lepidopteran, with a rich folk- 
lore that stems from its sinister skull-like 
thoracic markings, unusual habit of raiding 
beehives to steal honey, and startling acoustic 
capabilities (16, 17). A. atropos is a long- 
distance Afro-Palearctic migrant, arriving to 
breed in Europe north of the Alps each spring. 
The subsequent generation returns south the 
following autumn to winter-breeding regions 
in the Mediterranean Basin and likely also 
sub-Saharan Africa (16, 17), covering a distance 
of up to 4000 km. The moths are extremely 
large for flying insects, weighing up to 3.5 g 
[2.65 + 0.15 g (mean + SE), n = 14] and capable 
of carrying tiny very high frequency (VHF) 
radio transmitters. We used a light aircraft 
(Cessna 172) to track hawkmoths fitted with 
transmitters (Fig. 1A) and recorded precise 
(+150 m) GPS locations from the aircraft (78) 
at regular intervals throughout their migra- 
tion (every 5 to 15 min, when possible). 

We recorded nocturnal migratory flights of 
14 moths, with eight at high spatiotemporal 
frequency, as they migrated toward the Med- 
iterranean (Fig. 1, B and C, and table S1). 
Moths initiated migration at a similar time 
after sunset (62 + 4.9 min, range 42 to 81 min, 


nm = 8) and were then followed for a mini- 
mum of 1 hour and up to 3.65 hours (2.5 + 
0.30 hours, 7 = 8; table S1). The moths were 
followed for a mean distance of 62.7 + 6.7 km 
(n = 8) and up to 89.6 km (Fig. 1C and tables 
S1 and S82), the longest distance over which 
any insect has been continuously tracked in 
the field. The overall migration direction was 
toward the south-southwest [Rayleigh test: 
208.70° + 0.42° (mean + SD), mean vector 
length (7) = 0.917, P < 0.001, n = 14; Fig. 1D]. 
This track direction is very similar to the 
preferred headings of a range of migratory 
insects (moths, butterflies, and hoverflies) that 
were observed with radar in Western Europe 
(2, 10, 19, 20), including hawkmoths (10), all of 
which likely follow a similar western route to 
the Mediterranean or northwest Africa. 

We obtained detailed tracks for seven of 
these moths, each with three or more locations 
in a single night (table S2). Moths traveled 
with a mean ground speed of 9.4 + 0.4 m/s 
(33.8 km/hour; 7 = 99 segments; Fig. 1E) anda 
maximum recorded ground speed of 19.4 m/s 
(69.7 km/hour). The mean ground speed re- 
corded (Fig. 1E) is consistent with what we 
expect the upper limit of self-powered flight in 
A. atropos to be (21), suggesting that moths 
modulated their self-powered airspeed and/or 
received relatively modest wind assistance. Al- 
though there was variation in individual mi- 
gration direction, all moths maintained straight 
tracks (straightness index: mean = 0.95, range = 
0.80 to 0.99, 2 = 7; Fig. 1C and table S2) along 
their entire flight paths, which lasted many 
tens of kilometers, despite being subjected 
to winds of varying strength and direction 
throughout their course (Fig. 2). Two of the 
seven moths evidently crossed the Alps during 
a single night, because they were relocated 
south of the Alps during searches early the 
following morning. Their locations were con- 
sistent with their individual trajectories re- 
corded the preceding night, suggesting that 
they had maintained straight tracks even 
while transiting the Alps [covering distances 
of 173.9 and 161.8 km from the release point 
(Fig. 1C and table S1)]. 

To answer the question of how moths are 
able to maintain straight tracks relative to the 
ground while exposed to varying winds, we 
calculated the distribution of the angle of de- 
viation, B (the difference between the track 
and the downwind direction), to determine 
the extent to which the self-powered heading 
influenced the trajectory (22). The analysis 
revealed that moths used three distinct be- 
havioral strategies, which resulted in the flight 
paths of the moths grouping into three direc- 
tional clusters (Fig. 1, C and D). These clusters 
appeared to be partly determined by the am- 
bient wind conditions experienced along the 
flight path (Fig. 2) and partly by the to- 
pography of the landscape (Fig. 1C). 
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Fig. 1. Individual tracking reveals consistently straight flight paths in migrating hawkmoths. (A) Death’s- 
head hawkmoth (A. atropos) showing the characteristic skull-like marking (left) and fitted with a miniaturized 
VHF radio transmitter (weighing 240 mg) (right). (B) Map of Europe with the tracking region outlined [magnified 
view shown in (C)]; the release site was ~50 km north of the Swiss Alps. (C) Nighttime tracks of migrating 
hawkmoths showing persistently straight trajectories throughout a night's flight. Solid lines indicate moths that 
were tracked continuously throughout a night, and dashed lines indicate presumed tracks of moths to their 
elocation position during searches in the following day(s). Colors represent different individuals and are consistent 
between figures. (D) Mean track directions of the 14 moths that demonstrated migratory behavior. Each point 
epresents the track direction of an individual moth. The arrow indicates the overall mean direction (208.70°), and 
arrow length indicates the directedness (r = 0.917). (E) Frequency distribution of track segments in each of 

10 ground-speed categories (each bar represents a range of 2 m/s) for the seven moths that were 
tracked continuously on migration (n = 99 segments). The dashed line indicates the mean of 9.4 m/s. 
Photo credit: Christian Ziegler] 
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The first strategy was used under opposing 
wind directions and resulted in moths taking 
the most direct route to the wintering grounds 
by maintaining a constant southward track 
(Fig. 2, A and B). Under this strategy, the 
moths continuously adjusted their headings 
so that distributions of B had 95% confidence 
intervals (CIs) that overlapped 180° and had a 
mean f close to that value (Fig. 3A and table S2), 
resulting in more-or-less upwind flight (Fig. 4, A 
to D). Examination of ground speeds and wind 
speeds along the track (figs. S1 and $2) (27) 
indicated that moths that used this orientation 
strategy must have flown close to the ground 
(50 m or lower), that is, within their “flight 
boundary layer” [the lower-most layer of the 
atmosphere within which the insect’s self- 
powered flight speed exceeds the wind speed, 
allowing control of their trajectory (4, 22)]. 

The second and third strategies were both 
used under favorable wind directions (i.e., oc- 
casions when southward flight would expose 
moths to some degree of tailwind assistance). 
We predicted that moths using tailwind assist- 
ance would fly in the layer where winds were 
fastest, as previously observed in studies of 
noctuid moths (J0, 19). However, examination 
of ground speeds and airspeeds on these oc- 
casions indicated that hawkmoths that used 
these strategies flew about 300 m above the 
ground, considerably lower than the wind- 
speed maxima available (figs. S1 and S2) 
but high enough to receive some wind assis- 
tance (Fig. 4, E to G). Under these conditions, 
moths appear to balance speed with direction, 
as seen in other migrant moths (JO). The sec- 
ond orientation strategy involved flying rela- 
tively close to the south-westward downwind 
direction (Fig. 2, C and D), but individuals 
modified their heading to achieve a straight 
trajectory lying somewhat further south of 
the strongest wind (as supported by values of 8 
around —30° to —50° and for which the 95% CIs 
do not overlap with 0°; Fig. 3B and table S2). 
The final orientation strategy, which was used 
by a single individual (moth 5), involved flying 
directly downwind (as indicated by the 95% CI 
of 6B overlapping 0°; Fig. 3C and table S82), 
resulting in a track toward the west-southwest 
(Fig. 2D) with a higher ground speed than any 
other moth (Fig. 4H). 

In general, there was a negative relation- 
ship between airspeed and wind assistance, 
with airspeed increasing in headwinds and 
decreasing in tailwinds (Fig. 4). Furthermore, 
median ground speed was relatively similar 
across the orientation strategies (Fig. 4H). 
Thus, moths modulated their ground speed by 
varying their self-powered flight vector under 
different wind conditions to achieve a pre- 
ferred ground speed, similar to that docu- 
mented in many insects (23), which may be 
beneficial in the trade-off between energy 
consumption and travel speed (22). 
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Fig. 2. Migrating hawkmoths continuously compensate for wind to maintain 
straight flight paths. (A to D) Tracks of migrating A. atropos in relation to 
wind direction and speed (length of the arrows). The moths exhibit different 


strategies under different wind conditions, traveling due 


when primarily encountering headwinds [(A) and (B)] but traveling toward the 


The maintenance of consistently straight 
tracks and regulation of ground speed through- 
out the night under variable wind conditions 
strongly suggests that A. atropos has an inter- 
nal compass mechanism. Flight simulator 
studies have demonstrated that migrating 
Bogong moths (Agrotis infusa) use a combi- 
nation of visual landmarks and Earth’s mag- 
netic field to navigate toward a goal (13). This has 
yet to be demonstrated in free-flying migra- 
tory insects, but we predict that migrating 
hawkmoths, which have excellent nocturnal 
vision (24), use a similar suite of sensory mo- 
dalities to navigate over very large spatial 
scales during migration (although nothing is 
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yet known of the capability of hawkmoths to 
detect magnetic fields). At the landscape scale, 
we propose that the moths used topographical 
cues to visually navigate, because magnetic 
cues are unlikely to be accurate enough to 
maintain such straight trajectories. Overlaying 
the straight tracks on a topographical map 
(Fig. 1C) shows that the three orientation strat- 
egies, and their directional clusters, are each 
clearly aligned with a topographical feature that 
would also result in avoiding the highest eleva- 
tions of the Alps (high-altitude passes running 
due south and southwest through the Alps and 
a wide valley running west-southwest that would 
enable circumventing the Alps altogether). 


8.8°E 
Longitude 


southwest and thus skirting the Alps under tailwind conditions [(C) and (D)]. 
Colors represent different individuals and are consistent between figures. Wind 
layers are derived from the COSMO-1 model and represent conditions at 50 m 
above ground level [(A) and (B)] and 300 m above ground level [(C) and (D)], the 
estimated altitudes at which the moths were flying in the corresponding cases. 


Here, we provide evidence that large night- 
flying insects actively select an orientation 
strategy in response to environmental con- 
ditions, at least for some part of their mi- 
gratory journey. To maintain such straight 
trajectories over long periods of time, as seen 
here, the moths must regularly update their 
position relative to whichever navigational 
cues they rely on. However, complete compen- 
sation has not been previously documented in 
a long-range migratory insect and is generally 
an unusual and very rare strategy in long- 
range migrants (25). Our results show that 
complex migratory strategies are not limited 
to vertebrates. 


science.org SCIENCE 


RESEARCH | REPORTS 


Moth1 Moth2 Moth3 Moth11 


390 


> 
{ 
8 
=m © 
8 
gm © 
8 
8 
Do 
8 


t st 
> 1 = 1 
N 2 i =} 
ch Bis 
= 
U 
! Fig. 3. Strategies of migrating hawkmoths in relation to 
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winds encountered along the route was explored by analyzing 
distributions of the angle of deviation [B, the difference 
between the trajectory (T) and the wind direction (W)] for 
each segment of the trajectories of the individual moths 
shown in Fig. 1. (A) In unfavorable conditions such as 
headwinds and variable winds, moths had distributions of B 
with 95% Cls that overlapped 180° in all cases (table S1), 
indicating that they compensated for drift and maintained a 
southward track by selecting a flight heading (H) directly 
upwind. (B and ©) Under favorable conditions (i-e., winds 
blowing toward the southwest), moths showed one of two strategies. As shown in (B), some 
moths had mean values of B around 45° and 95% Cls that did not overlap with 0° (table S1), 
indicating that they partially compensated for drift to migrate closer to south than the 
downwind flow would transport them. As shown in (C), moth 5 had a distribution of B that 
overlapped with 0° (table S1), indicating that it headed more-or-less straight downwind. On 
the circular plots, each point represents the value of £ for all trajectory segments of each 
moth. The arrow indicates the overall mean value of B, and arrow length (r) indicates the 
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Fig. 4. Migrating hawkmoths modulate their airspeed in relation to wind 
assistance. (A to G) Airspeed (m/s) of seven migrating A. atropos individuals [(A) to 
(G)] in relation to wind assistance (m/s) en route. Wind assistance was calculated 
as the wind vector at the location of the moth in the direction of travel toward its next 
location, with positive values indicating tailwind and negative values indicating 
headwind. Moths generally increased their airspeed under headwind conditions [(A) 
to (D)] and reduced their airspeed in more favorable tailwind conditions [(E) to 
(G)]. (H) Ground speed per segment for each of the seven individuals. Boxes 
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represent the 25th and 75th percentiles [interquartile range (IQR)], and whiskers 
indicate data within +1.5 times the IQR. The central bars represent median values. 
Colors represent different individuals and are consistent between figures. Regression 
lines from linear models (LMs) are presented for significant relationships. LMs 
were performed for individuals with more than five data points. Significance 

(P < 0.05) was based on likelihood-ratio tests: (A) moth 1, F = 651.91, P < 0.001; 
(D) moth 11, F = 20.66, P < 0.001; (E) moth 4, F = 0.065, P = 0.807; (F) moth 6, 
F = 24.69, P = 0.008; and (G) moth 5, F = 0.465, P = 0.514. 
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FLUID DYNAMICS 


Dynamics of active liquid interfaces 


Raymond Adkins’, Itamar Kolvin’*+, Zhihong You'}, Sven Witthaus’, 


M. Cristina Marchetti*?*, Zvonimir Dogic’?* 


Controlling interfaces of phase-separating fluid mixtures is key to the creation of diverse 
functional soft materials. Traditionally, this is accomplished with surface-modifying chemical 
agents. Using experiment and theory, we studied how mechanical activity shapes soft interfaces 
that separate an active and a passive fluid. Chaotic flows in the active fluid give rise to giant 
interfacial fluctuations and noninertial propagating active waves. At high activities, stresses 
disrupt interface continuity and drive droplet generation, producing an emulsion-like active 
state composed of finite-sized droplets. When in contact with a solid boundary, active 
interfaces exhibit nonequilibrium wetting transitions, in which the fluid climbs the wall 

against gravity. These results demonstrate the promise of mechanically driven interfaces 


for creating a new class of soft active matter. 


iquid-liquid phase separation (LLPS) is a 

ubiquitous phase transition, with exam- 

ples abounding throughout material sci- 

ence, biology, and everyday life (/, 2). 

Immiscible liquid phases are separated 
by sharp but deformable interfaces that strong- 
ly couple to flows and the input of mechanical 
energy. For example, gentle shaking of an oil- 
water mixture induces gravity-capillary inter- 
facial waves, whereas more vigorous perturba- 
tions break up the entire interface, reinitializing 
the phase separation (3-6). Active matter pro- 
vides an alternative method of continuously 
stirring a fluid (7, 8). In such systems, me- 
chanical energy, inputted locally through the 
motion of microscopic constituents, cascades 
upward to generate large-scale turbulent-like 
dynamics (9-17). We studied how active stresses 
and associated flows perturb soft interfaces and 
LLPS. Using experiment and theory, we iden- 
tified universal features of active-LLPS, includ- 
ing giant interfacial fluctuations, traveling 
interfacial waves, activity-arrested phase sepa- 
ration, and activity-induced wetting transitions. 
These results demonstrate how active matter 
drives liquid interfaces to configurations that 
are not accessible in equilibrium. In turn, active 
interfaces are elastic probes that provide in- 
sight into the forces that drive active fluids— 
for example, by allowing for the measurement 
of the active stresses. 

The active liquid interfaces we studied belong 
to a wider class of activity-driven boundaries 
that includes lipid bilayers, colloidal chiral 
fluids, and interfaces between motile and im- 
motile bacteria in a swarm (72-16). From a 
biology perspective, LLPS has emerged as a 
ubiquitous organizational principle (2, 17). 
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How cytoskeletal active stresses couple to self- 
organization of membraneless organelles re- 
mains an open question. Studies of simplified 
systems can shed light on these phenomena. 
Relatedly, active wetting plays a potential role 
in the development and shaping of tissues (18). 

To explore how activity modifies soft inter- 
faces, we combined poly(ethylene glycol) (PEG) 
and polysaccharide dextran with stabilized 
microtubule filaments and clusters of kinesin 
molecular motors. Above a critical polymer 
concentration, the passive PEG-dextran mix- 
ture phase separated (19). Microtubules and 
kinesin clusters exclusively partitioned into 
the dextran phase, in which depletion forces 
promoted microtubule bundling (Fig. 1, A to 
C). Streptavidin-bound kinesin clusters (KSA) 
stepped along adjacent microtubules within 
a bundle, driving interfilament sliding. The 
kinesin-powered bundle extensions contin- 
uously reconfigured the filamentous network, 
generating large-scale turbulent-like flows, 
similar to those previously studied (Fig. 1D) 
(9). The PEG-dextran interfaces were suscepti- 
ble to large deformations by active stresses gen- 
erated within the dextran phase because of their 
ultralow interfacial tension (<1 uN/m) (19). 

We first visualized the phase separation dy- 
namics of active LLPS in ~30-um-thick hori- 
zontal microscopy chambers. In such samples, 
PEG-dextran interfaces had a nearly flat vertical 
profile (fig. S1). The quasi-two dimensional 
(2D) nature of the system was supported by a 
nearly constant area fraction of the PEG-rich 
domains (fig. S2). In a passive system with 
microtubules but no kinesin motors, the drop- 
lets coalesced slowly (Fig. 1E and movie S1). 
The addition of motors altered the coarsening 
kinetics. At intermediate KSA concentrations, 
active flows powered droplet motility, which 
increased the probability of droplets encoun- 
tering each other and coalescing, thus speed- 
ing up coarsening dynamics (Fig. 1E and 
movie S2). Higher KSA concentrations accel- 
erated buildup of interfacial fluctuations, 
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leading to an entirely different dynamical 
state in which droplets incessantly fused and 
fissioned with each other (Fig. 1, E and G, and 
movie S3). 

To quantify the influence of activity on the 
PEG-dextran phase separation, we measured 
the equal-time two-point correlation function 
C(Ar, ¢) = (I(r + Ar, t)I(r, t)),, where J = 1in 
the dextran phase and -1 otherwise (fig. S3). 
Spatial correlations decayed over a length scale 
&, defined by C(é) = 0.5, which is comparable 
with the average droplet size (fig. S4). For pas- 
sive samples, € increased slowly in time (Fig. 
1F). Enhanced coarsening at intermediate KSA 
concentration was reflected by a much faster 
initial growth of & than the passive case. At 
high motor concentration, € peaked at ~1 hour 
and subsequently decayed to a finite plateau, 
Esteaay- In parallel, average interface curvature 
« monotonically grew, reaching a sufficiently 
large value to cause droplet fission (fig. $5). 
The steady-state length scale Esteaay Was Main- 
tained by the balance of droplet fission and 
fusion events, in which Esteaay WaS Compara- 
ble with the inverse of the average interface 
CUrvatUTe Ksteaay- Concomitantly with the pla- 
teauing of €, active flow speed became constant 
(Fig. IF). These results demonstrate activity- 
suppressed coarsening dynamics, which created 
an emulsion-like state in which finite-sized 
droplets continuously merge, break apart, and 
exchange their content (Fig. 1G and movies S3 
and S4). The volume fractions of the active and 
passive phases were nearly equal (fig. S2). Low 
volume fraction of active fluid generated sim- 
ilar steady states. Finite-sized domains are 
reminiscent of theoretical prediction in motility- 
induced phase separation of isotropic active 
particles (20). However, in contrast to theory, 
the active fluid in our experiments is aniso- 
tropic and perturbs an underlying equilibrium 
phase separation. 

To gain insight into how active stresses drive 
interfacial fluctuations, we formed a macro- 
scopic interface through gravity-induced bulk 
phase separation (Fig. 2A and fig. S6). In equi- 
librium, molecular motion works against the 
density difference Ap and interfacial tension y 
to roughen the liquid-liquid interface. Typical 
disturbances of PEG-dextran interfaces, bereft 
of activity, are ~100 nm in amplitude, resulting 
in boundaries that appear flat when viewed 
with our imaging setup (Fig. 2A and fig. S7). 
When driven out of equilibrium, however, 
interfaces exhibited giant undulations that 
were visible with the naked eye (movie S5). 
As motor concentration increased, interfaces 
became multivalued with frequent overhangs, 
indicating that active stresses directly control 
interface configurations (Fig. 2A and movie S6). 

The interplay of activity and capillarity is 
clarified by measuring the interfacial fluctu- 
ation spectrum. To this end, local interface 
tangent angles 0(s, 2) were sampled at a time 
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tas a function of the arc-length distance s along 
the interface (Fig. 2A). Interfacial fluctuations 
were described by time-averaged power spectra 


S(k) = ((|®x||”),. with 0, = /ds0(s, t)e~™*. Be- 


cause of equipartition of thermal energy among 
Fourier modes, the spectrum of equilibrium 
interfaces is S(k) ~ Tk? /(k? + k2), where T 
denotes temperature. The capillary wave 
numberk, = ,/Apg/ysets a crossover from 
a gravity-dominated regime at large scales 
S(k) ~ k? to a plateau at small scales, where 
surface tension attenuates fluctuations. Active 
interfacial fluctuations were markedly differ- 
ent. Active spectra S(x#) increased for small 
wave numbers (Fig. 2B and figs. S7 and S8). 
After reaching a maximum for k,, ~ 30 mm”, 
it decayed as S(k) ~ k°, instead of plateauing 
as in equilibrium. Whereas the shape of S(x) 
remained the same for all KSA concentrations, 
the root mean square tangent angle 0,.,, in- 
creased linearly with activity (Fig. 2B, inset). 
Using the crossover at k,, as a determinant 
of the fluctuation amplitude, it would take an 
effective temperature of ~10"K to achieve equi- 


Fig. 1. Active LLPS. 
(A) Coexisting 
PEG-rich (dark) and 
dextran-rich (cyan) 
domains. (B) Labeled 
microtubules (red) are 
dissolved in the dextran 
phase. Scale bar, 
75 um. (C) Microscopic- 
scale depiction of phase Bh ‘ 
: a Dextran Phase 
separation. Minority meee 
PEG polymers (gray) in | PEG Phase 
the dextran-rich phase 
induce microtubule 
bundling. (D) Kinesin 
clusters drive interfila- 
ment sliding. (E) Time 
evolution of the active 
LLPS at three KSA 
concentrations. Scale 
bar, 350 um. (F) (Top) 
Correlation length 
evolution E(t) for 
three KSA concentra- 
tions. For 230 nM KSA, 
E plateaus at long 
time (yellow highlight). 
(Bottom) Root mean 
square velocity 
of turbulent flows 
in the dextran 
phase at 230 nM 
KSA. (G) Fusion 
and fission of PEG 


librium interfaces whose roughness is com- 
parable with those measured at the lowest 
activities. 

The dynamics of activity-driven interfacial 
fluctuations exhibited nontrivial spatiotem- 
poral correlations. To gather sufficient statis- 
tics, we imaged ~10-mm-long active interfaces 
over a 2-hour interval. Space-time maps of local 
interface height h(a, t) exhibited diagonally 
streaked crests and troughs that were sug- 
gestive of propagating waves (Fig. 2C). These 
translational modes were also evident in time- 
lapse movies (movie S6). To characterize these 
modes, we measured the dynamic structure 
factor (DSF) of the interface height D(k, o) = 
|dadte"*” +" (h(a', t')h(a’ + 2, t' +t)» » Fig. 
2D). Over a finite range of wave numbers, the 
DSF exhibited peaks at finite frequencies o,, 
confirming the presence of propagating modes 
(Fig. 2E). Increased KSA concentration resulted 
in higher @, for the same wave numbers; thus, 
activity controlled the phase velocity (Fig. 2F). 

The giant nonequilibrium fluctuations and 
propagating wave modes result from the in- 
teraction of active flows in the bulk dextran 


Kinesin 
motor 
cluster 


4 i 

o— ——OnM 
—135 nM 
——230 nM 


Steady-state 


droplets. Sample chamber thickness, 30 um. Scale bar, 100 ym. 
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phase with interfacial elasticity. To elucidate 
the processes that drive active interfaces, we 
numerically integrated 2D hydrodynamic equa- 
tions that describe a bulk phase-separated 
active fluid (2/7, 22). The two coexisting phases 
were modeled as incompressible Newtonian 
fluids under gravity that experience confine- 
ment friction in the low Reynolds number 
limit (23). The top phase was passive, where- 
as the velocity of the bottom phase v was 
governed by 


yVv —1V’°v = -VP+V-o (1) 


where P is the pressure, 1 is the viscosity, and 
yy is the confinement friction. The stresses o 
driving the flows were assumed to be gen- 
erated by an active liquid crystal producing 
extensile active stresses, o* = oQ, with a < 0. 
The local orientational order was quantified 
by atraceless tensor Qy = (nity — 54/2) aver- 
aged over molecular orientations n. Active 
shear flows engendered orientational order 
through flow-induced alignment. These as- 
sumptions are summarized in the continuum 
equation 

DQ | 


af 
py TQ = mt M2) 


where @ is the vorticity tensor, u is the strain 
rate, and / is the flow alignment parameter. H 
denotes elastic forces that arise from the liq- 
uid crystal free energy, and yg is the rota- 
tional viscosity of microtubule bundles (23). In 
the absence of activity, the liquid crystal is 
in the isotropic phase, which is consistent with 
the microtubule density used in experiments. 

The hydrodynamic model reproduces key 
experimental observations. Finite-sized chaotic 
flows, driven by active stresses, continuously 
deform the liquid-liquid interface (Fig. 3A 
and movie S7). Similarly to experiments, the 
interfacial power spectra showed a crossover 
between growth at small wave numbers and 
decay at large wave numbers, whereas the root 
mean square tangent angle increased linearly 
with |o| (Fig. 3B). The numerically obtained 
DSF also exhibited signatures of active travel- 
ing waves (Fig. 3, C to E), as in the experiment. 
The wave frequencies m,(%) increased with 
activity, showing active-stress-dependent wave 
velocity. Our numerical model also suggests 
a noninertial mechanism of active waves 
(12), which differs from conventional inertia- 
dominated capillary waves (24, 25). In the 
context of our hydrodynamic model, the active 
waves arise from the coupling between the 
interface vertical displacement / and orienta- 
tional order Q in the interfacial region (22). 
Stress balance at the interface predicts that the 
orientational order drives interfacial deforma- 
tion as 0;h ~ v* ~ a|k|Q, where v* is the active 
contribution to the flow velocity normal to the 
interface. In turn, passive flows v? relaxing the 
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Fig. 2. Active propagating waves. (A) Conformations of bulk phase-separated interfaces with increasing 
motor (KSA) concentration. 6(s) defines the local interface tangent angle as a function of arc length. 

h(x) is the local interface height. Chamber thickness, 60 um. Scale bar, 150 ym. (B) @ power spectra obtained 
by averaging over ~3-mm interface length, 6 to 8 hours after sample preparation. (Inset) Root mean 
square @ versus KSA concentration. (C) Space-time map of h. Disturbances propagating along the 
interface (dashed lines). Interfaces were corrected for drift and tilt. (D) The square-root DSF averaged 

4 to 6 hours after sample preparation. Maximum lag distance, 1000 um; maximum lag time 1800 sec. 

(E) DSF sections at constant wave number (colored dots). Black lines are best approximations with 


F(a) =a [(o/ar)? sf | 


-1 2 = 
+ bf [(/a0)° = y + (oaw/af)*} _ where a, b, @1, Wo, and Aw are 


adjustable parameters. Data were taken over 2 to 4 hours after sample preparation. (Inset) Frequency peaks 


@p = \/@pAw*/2 overlayed on DSF intensity (full line). DSF maxima for constant 


(dashed line). For (C) to (E), KSA concentration, 270 nM. (F) Peak frequencies of the propagating modes ap 
(solid symbols). DSF maxima at constant frequency sections (open symbols). 


interface at a wave number dependent rate v(k) 
feed back to induce local liquid crystalline 
order 0,Q ~ Aikv? ~ —Aikv(k)h. Consequent- 
ly, interface height obeys a wave equation 
O2h~ — ahik|k|v(k)h (23). Accordingly, travel- 
ing wave velocities increase with active stress, 
which is in agreement with both experiments 
and numerics. 


Propagating waves might be a generic fea- 
ture of active boundaries (72, 13). More broadly, 
the active-stress-dependent wave dispersion 
mirrors those of elastic waves in entangled 
polymers solutions (26). Although the numer- 
ical hydrodynamics reproduced qualitative 
features of the experimentally observed active 
fluctuations and waves, there were important 
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Fig. 3. Numerical hydrodynamics of active interfaces. (A) Phase A 
boundary (cyan) separating a passive fluid (black) and an active 

isotropic liquid crystal. The latter are depicted with local order 

parameter (grayscale) and orientation (red lines). Legends denote B 
activities |a|. Scale bar, 100 um. (B) 8 power spectra. Simulation 
interval, 2 mm. Correlation maximum lag distance, 400 um. 
(Inset) Root mean square @ versus |a|. (©) Square root DSF. 


interval, 2 mm. (D) 


DSF intensity (solid 


Activities are noted in absolute values. 


Fig. 4. Active wetting transition. (A) Active fluid wetting a 
polyacrylamide-coated vertical boundary at three KSA 
agnified section 

of the wetting layer at 345 nM KSA showing local orientations 
of microtubule bundles. Scale bar, 20 um. (€) Average 
center-of-mass height of active fluid within 5¢, of the vertical 
boundary, where ¢, = 45 um. Zero height is defined as 

the average bulk interface position. Each point is the mean 

of two experiments. Uncertainty in sample preparation is 
indicated with horizontal bars. (Inset) Average wetting height 
profiles as a function of distance from the vertical boundary. 
80 nM (blue) to 270 nM (red). 

Black line indicates ~e~*/“e. (D) Wetting profiles from 
numerical simulations. Equilibrium contact angle is 6, = 10°. 
Interface position is in cyan. Scale bar, 50 um. (Inset) Local 
liquid crystalline orientation (red lines). (E) Center-of-mass 
height of the numerical wetting profile. Red line indicates 
prediction of Eq. 3 with 4, = 2.5 um. Onset of complete wetting 
is indicated with the blue arrow. (Inset) Average numerical 
wetting profiles. Activities are 10 mPa (blue) to 40 mPa (red). 


concentrations. Scale bar, 50 um. (B) 


KSA concentrations, 


quantitative differences. In particular, with 
increasing activity, numerics predicted an 
increase in both interface fluctuations and 
bulk velocity. By contrast, in experiments ac- 
tive flows remained constant between 200 and 
350 nM KSA, whereas interfacial fluctuations 
increased (fig. S9). Moreover, in numerics, the 
maximum wave number #,, increased with 
activity while remaining constant in experi- 
ments (Figs. 2B and 3B and fig. S10). 

To demonstrate the particular properties of 
active interfaces, we studied their structure 
next to a solid boundary (Fig. 4, A and B). In 
the absence of motors, the interface assumed 
an exponential profile h o e*/"e with a decay 
length of @, ~ 45 um, which we identified with 
the capillary length £. = \/y/Apg. At the wall, 
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a.| = 40 mPa. Maximum lag time, 270 s. Maximum 


ag distance, 670 um. Total simulation time, 3 hours. Simulation 
Constant wave number sections of DSF 
intensity (solid circles). Best approximations F(@) are as in 

Fig. 2E (black lines). |a| = 20 mPa. (Inset) wp(k) overlay on 

line) and DSF maxima for constant w 

(dashed line). (E) Peak frequencies of the propagating modes 

®p (solid symbols) and DSF maxima for constant @ (open symbols). 
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the rise in the interface height was ~’70 um, 
which is close to the maximum capillary rise 
of //2¢,, indicating complete wetting (23). At 
intermediate KSA concentrations, the capil- 
lary rise exhibited active fluctuations around 
the equilibrium exponential height profile, 
and the time-averaged center-of-mass height 
of the wetting region increased slowly (Fig. 
4C). Above a critical value of 300 nM KSA, 
activity generated a new interfacial struc- 
ture. Specifically, we observed formation of a 
~20-um-thick dynamical wetting layer, which 
climbed several hundred micrometers above 
the equilibrium capillary rise (Fig. 4A and 
movie S8). Within this layer, microtubule 
bundles preferentially aligned with the wall 
(Fig. 4B). Coinciding with the appearance of 


the microtubule-rich wetting layer, the mean 
capillary rise sharply increased (Fig. 4C). These 
observations demonstrate an activity-driven 
wetting transition beyond the complete wetting 
of a passive fluid. 

We performed numerical simulations of the 
active-interface adjacent to a vertical boundary 
(23). The liquid crystal director was anchored 
parallel to the wall, and the surface-liquid en- 
ergy Y,. corresponded to an equilibrium contact 
angle 10° (Fig. 4D). Similar to experiments, 
the average height profile had an exponential 
decay (Fig. 4E, inset). As the activity o increased 
from zero, the height of the contact point in- 
creased. Furthermore, above |a| = 10 mPa, the 
active fluid generated a thin wall-adjacent 


layer, indicating a transition from partial to 
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complete wetting (Fig. 4D and movie S9). The 
capillary rise was supported by a ~3-um-thick 
£,,, vertically aligned liquid crystalline domain, 
with Q ~ 1. This domain generated coherent 
active stress along the wall o* = a, which sup- 
ported the interface rise. Balancing the active 
tension y, =|o*|4, at the contact point with 
wall adhesion y,,, surface tension ycos@,, and 
gravity F, resulted in a boundary condition for 
the climbing interface (fig. S11) (23). 


|o" | + Yw = yeos®, + Fy (3) 


Predictions of the center-of-mass height of the 
capillary rise, using Eq. 3, show a crossover 
from slow to fast growth at a = 10 mPa, which 
is in agreement with the partial-to-complete 
wetting transition seen in simulations (Fig. 4E 
and fig. S12). 

Active interfaces provide a distinct exper- 
imental probe with which to estimate the 
magnitude of the active stress, a critical pa- 
rameter that governs dynamics of active flu- 
ids. To avoid resorting to various assumptions 
on the numerical model, we analytically 
solved Eq. 1, treating the stress o as a random 
field with correlations (oj(r, t)o;(0,0)) = 
C2 nse t/a ltl/%, where correlation length ¢, 
and time t, are identified with those of the 
bulk active flow (fig. S13) (23). The analytical 
model captured the interface fluctuations 
spectrum S(x) without fitting parameters, 
revealing that its nonmonotonic shape resulted 
from active flows with scale-dependent kinetic 
energy spectrum (Fig. 5A, fig. S10, and eq. S32) 
(23, 27). By contrast, fluctuations of equilibrium 
interfaces are driven by thermal energy ~p7, 
where kx is the Boltzmann constant, that is 
equally distributed among all scales (23). By 
integrating S(k) over all wave numbers, the 
active stress is predicted to increase propor- 
tionally to tangent angle fluctuations o,,; = 
P0ms, With p =~ 9 mPa/rad (Fig. 5B and eq. $35). 

To independently verify these numbers, the 
force balance Eq. 3, which is associated with 
active wetting, provides an alternative method 
of estimating active stress. For intermediate 
KSA values, before the appearance of the ac- 
tive wetting layer, the active stress estimated 
from wetting is comparable with those ex- 
tracted from interface fluctuations (Fig. 5B). 
Above 300 nM KSA, active stresses are a few 
times larger than those obtained from inter- 
face fluctuations. These large values might be 
a consequence of flow-enhanced alignment of 
microtubule bundles within the thin wetting 
layer. The formation of the active wetting layer 
at finite activity, however, is outside the scope 
of the static stress balance embodied in Eq. 3. 
A more complete description of the wetting 
transition would include dynamical consid- 
erations, such as active wave propagation and 
gravitational sedimentation, that appear to be 
essential for the formation and turnover of the 
wetting layer. 
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Fig. 5. Active stress estimation from interface 
fluctuations and wetting. (A) Normalized tangent 
angle power spectrum. KSA concentration, 200 nM 
(circles). Analytical hydrodynamic prediction is 
indicated with the dashed line (eq. S35). (B) Root 
mean square active stress estimates from interfacial 
fluctuations G;ms = POrms (circles). Experimental 0,5 
are as in Fig. 2B, inset. Analytical hydrodynamic 
theory (Eq. 1 and eq. S35) predicts p = 8.8 mPa/rad, 
by using active length scale é, = 65 um and time 
scale t, = 77 s that were derived from bulk flow 
correlations (fig. S13). Active stress estimates from 
wetting o? = 4(hom — h9,,) Apg (Squares). Mom is the 
wetting center-of-mass height (Fig. 4C). ho, equals 
Mem for 180 nM KSA. 


We demonstrated a rich interplay between 
active fluids and soft deformable interfaces. 
Liquid interfaces provide a quantitative probe 
that can reveal intrinsic properties of the ac- 
tive fluids, such as active stress. As well, bulk 
active fluids drive the extreme interfacial de- 
formations that yield intriguing nonequili- 
brium dynamics, including arrested phase 
separation, stress-dependent noninertial prop- 
agating waves, and activity-controlled wetting 
transitions. Our findings provide a promising 
experimental platform by which to design 
shape-changing adaptable soft materials and 
machines whose capabilities begin to match 
those observed in biology (28-30). 
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EVOLUTION 


The evolutionary network of whiptail lizards reveals 
predictable outcomes of hybridization 


Anthony J. Barley'7t*, Adrian Nieto-Montes de Oca, Norma L. Manriquez-Moran‘, Robert C. Thomson” 


Hybridization between diverging lineages is associated with the generation and loss of species diversity, 
introgression, adaptation, and changes in reproductive mode, but it is unknown when and why it 
results in these divergent outcomes. We estimate a comprehensive evolutionary network for the largest 
group of unisexual vertebrates and use it to understand the evolutionary outcomes of hybridization. 
Our results show that rates of introgression between species decrease with time since divergence and 
suggest that species must attain a threshold of evolutionary divergence before hybridization results 

in transitions to unisexuality. Rates of hybridization also predict genome-wide patterns of genetic 
diversity in whiptail lizards. These results distinguish among models for hybridization that have not 
previously been tested and suggest that the evolutionary outcomes can be predictable. 


nderstanding why mechanisms of spe- 

cies diversification change across the 

tree of life remains one of the most 

challenging endeavors in biology. Re- 

cent studies have demonstrated how 
the strength and pace of mutation, genetic 
drift, and selection can generate distinct pat- 
terns of diversification (J-3). Gene flow through 
hybridization is arguably the most poorly 
understood of these primary mechanisms of 
evolution, despite an increasing appreciation 
for its importance during speciation (4). This 
is driven, in part, by the fact that hybridization 
induces a network structure in evolutionary 
history that is vastly less tractable to infer 
than bifurcating phylogenetic history. There- 
fore, our understanding of speciation in many 
groups where hybridization is common re- 
mains incomplete. 

The diversity of hybridization outcomes in 
nature is substantial. Hybridization is asso- 
ciated with numerous evolutionary pheno- 
mena, including genetic introgression, hybrid 
speciation, lineage fusion, changes in repro- 
ductive mode, and polyploidization (5). These 
distinct biological outcomes have different ef- 
fects on diversification. For example, hybrid 
speciation and lineage fusion generate or eli- 
minate species diversity, respectively. Tran- 
sitions from sexual to unisexual reproduction 
are also disproportionately associated with hy- 
bridization (6). These phenomena have moti- 
vated theories to explain the relationship 
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between hybridization and the evolution of 
unisexuality and the effects of hybridization 
on genetic diversity and microevolution (7, 8). 
However, our understanding of the manner 
and extent to which the diverse outcomes of 
hybridization are predictable remains ham- 
pered by a lack of empirical data for testing 
these theoretical models. This is in part caused 
by a limited number of biological systems 
that contain replicated examples of each 
outcome. In this work, we used the North 
American whiptail lizard (Aspidoscelis) to 
test whether hybridization produces pre- 
dictable outcomes. 

Whiptails include the highest diversity of 
unisexual lineages among vertebrates. The 
clade includes >30 sexual species and ~15 
recognized unisexual taxa that reproduce by 
parthenogenesis. Diploid unisexual lineages 
in this clade are derived from hybridization 
events between sexual species that form F1 
hybrid offspring that reproduce clonally (9). 
Triploid unisexual lineages are thought to 
be derived from backcrossing events between 
diploid unisexual lineages and sexual species. 
Even before unisexual reproduction was docu- 
mented in whiptails, the clade was recognized 
as being “an ideal object lesson in Nature’s 
way of species-making” because of the notable 
patterns of phenotypic variation they exhibit 
across populations (10). However, these pat- 
terns of variation and hybridization also pro- 
duced myriad complications in defining the 
boundaries between species that have hin- 
dered understanding of the group’s systematics 
and diversification history despite decades 
of study (9-11). In this work, we study the 
tree-like and network-like patterns of evolution 
in Aspidoscelis to understand the mecha- 
nisms that underlie their divergent outcomes 
of hybridization. 

Using a reduced-representation genomic 
dataset (tables S1 and S2 and data S1), we in- 
ferred patterns of evolutionary history among 
all the sexual whiptail species (72). Introgres- 


sion tests based on the genome-wide frequen- 
cies of site patterns detected widespread 
evidence of nontree-like patterns of evolution 
(fig. S1 and data S2 and S3). A substantial 
proportion (~60 to 70%) of the 33 sexual spe- 
cies appears to have been involved in intro- 
gressive hybridization events (figs. S2 and S3). 
Multiple significant tests were detected for 
several species, which suggests that they have 
genomes containing introgressed regions from 
multiple species (data S2 and S3). Phylogenetic 
network analyses found that patterns of intro- 
gressive hybridization among sexual whiptails 
could largely be captured by ~5 to 13 reticu- 
lation edges (figs. S4 and S5). The results 
across all hybridization analyses were largely 
congruent. We used admixture graphs to com- 
pare introgression hypotheses in the few cases 
where they were not (figs. S6 and $7 and tables 
S3 and S4). We detected no evidence of in- 
trogressive hybridization between the most 
distantly related species, even when they occur 
in sympatry (figs. S2, S4, and S5). Introgressive 
hybridization appears to have been common 
even when phenotypic evolution is rapid across 
populations, as it is in this clade, where ~'70% 
of species encompass multiple, morphologi- 
cally distinctive subspecies (table S1). Some 
whiptail subspecies may reflect instances of 
lineage fusion, where populations that have 
not diverged sufficiently to develop repro- 
ductive isolation come back into contact (13). 
Several of the inferred hybridization events 
include species that are morphologically, eco- 
logically, and evolutionarily divergent (figs. 
S2 and S8). We lack time estimates for the 
introgression events but suspect that they 
span a broad range of historical time frames, 
given the diversity of the introgression pat- 
terns (Table 1). 

We used phylogenetic network approaches 
to infer which of the sexual species are the 
parental ancestors of the hybrid, diploid uni- 
sexual lineages [fig. S9; see (12) for a discus- 
sion of terminology]. We used population 
genetics tools to infer the ancestry of the tri- 
ploid unisexual lineages (fig. S10 and table S5) 
(12). We identified 11 unisexual lineages that 
have distinct combinations of genetic ancestry 
derived from 10 sexual species (table S6). Six 
of these unisexual lineages are diploid, and 
two of these appear to have formed multiple 
times through hybridization. The other five 
are triploid, and two of these also appear to 
have been formed multiple times. Within uni- 
sexual whiptails, a primary hybrid speciation 
process always appears to precede speciation 
by ploidy elevation through genome addition 
(Fig. 1). 

Our ensemble analysis provides a compre- 
hensive view of the patterns and outcomes of 
hybridization in whiptails, including both in- 
trogressive hybridization and hybrid specia- 
tion (Table 1 and Fig. 2) (12). The phenomenon 
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Fig. 1. Summaries of hybrid speciation in unisexual whiptail lizards. Primary hybridization (i.e., a hybridization 
event between two diploid sexual species that is associated with transition to unisexual reproduction) precedes 
genome addition (i.e., ploidy elevation) (left). Sexual species associated with each process are indicated. Colored 
bars in STRUCTURE plot (each representing data for a single lizard) show estimates of the genome-wide proportion 
of genetic ancestry derived from sexual parental species for each unisexual individual. Violin plots show 95% 
confidence intervals for formation time estimates of diploid unisexual lineages. kya, thousand years ago. Photo is 
of Aspidoscelis deppii, a sexual species that is both an ancestor of the parthenogenetic lineage Aspidoscelis 
cozumelus and has been involved in introgressive hybridization with Aspidoscelis guttatus (photo credit: A.J.B.). 


Table 1. Summary of introgressive hybridization events and their support from alternative analyses. 
Posterior probability (PP) for one species in a pair as under reticulation edge in Bayesian analysis, with 
alternative species as an ancestor. AL indicates improvement in likelihood score when focal hybrid node is 
present in the network. Support for focal hybridization event by f branch analysis is indicated by the 
corresponding Z score. The HyDe column indicates the largest Z score calculation for a test that includes 
both focal species. Support in best admixture graph model is indicated by an X. Dashes indicate analysis is 
not applicable (PhyloNet) or introgressive hybridization was not supported by analysis (others). 


Hybridizing species Admixture PhyloNet PhyloNetworks f branch HyDe 
graph (PP) AL 

tigris tigris—tigris aethiops = 1.0 = 10.1 = 

d 


scalaris—costatus occidentalis X 1.0 - 9.6 40.9 


*Phylogenetic discordance between mitochondrial DNA and nuclear DNA datasets in Barley et al. (13) also supports these 
introgressive hybridization events. 
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of isolation and lineage divergence followed 
by secondary contact has occurred repeatedly 
across broad time scales in Aspidoscelis. Con- 
sequently, whiptails exhibit many replicated 
instances of both introgressive hybridization 
and hybrid speciation, which occurs through 
transitions in reproductive mode or ploidy 
elevation. This provides an opportunity to test 
competing hypotheses explaining the observed 
association between hybridization and the 
evolution of unisexuality in vertebrates. Three 
main hypotheses have been proposed. The 
phylogenetic constraint hypothesis proposes 
that particular sexual lineages are inherently 
predisposed to produce parthenogenetically 
competent individuals upon hybridization (6). 
The balance hypothesis predicts that as diver- 
gence among hybridizing sexual species in- 
creases, the probability that their hybrid 
offspring will produce unreduced oocytes also 
increases, allowing for the evolution of uni- 
sexual reproduction (14). This is balanced 
against the increasing likelihood that hybrids 
will experience impairments in fecundity and/ 
or viability arising from genetic incompatibil- 
ities if divergence between their parents is too 
high. Therefore, this hypothesis predicts that 
hybridizing lineages that are at intermediate 
levels of evolutionary divergence produce uni- 
sexual offspring, whereas those that are at too 
low or too high levels lead to sexual offspring 
or a failed cross, respectively. A further refine- 
ment on both of these hypotheses has been 
called the rare formation hypothesis, in which 
complex genetic preconditions are necessary to 
initiate unisexual reproduction in hybrids (15). 

These mechanisms are not mutually exclu- 
sive, but we see limited evidence for the phylo- 
genetic constraint hypothesis in whiptails. Of 
the six specific primary hybridization events 
associated with transitions to parthenogenetic 
reproduction, only two of the 10 sexual paren- 
tal species have been involved in more than 
one of these events (Fig. 2 and table S6). Fur- 
ther, if we model unisexual parental status 
as a discrete trait on the whiptail phylogeny, 
we see virtually no evidence of phylogenetic 
signal (Pagel’s 1. = 6.6 x 10°; Fritz and Purvis’s 
D = 0.68). By contrast, we see conspicuous 
support for the balance hypothesis. At low 
levels of divergence, hybridization among sex- 
ual whiptail species results only in introgres- 
sion (Fig. 3A). Only those hybridization events 
spanning deep levels of evolutionary diver- 
gence result in the production of unisexual 
lineages, and we did not detect evidence of 
introgression between the parental species of 
the parthenogenetic lineages. Divergence time 
estimates between the pairs of sexual parental 
species that have generated parthenogenetic 
lineages span a broad range (~15 to 25 million 
years; fig. S11 and table S7), but these hybrid- 
ization events have all occurred in the recent 
past, which suggests that these lineages are 
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Fig. 2. Evolutionary network history of the whiptails. Red reticulation edges represent introgressive 
hybridization events between sexual species. Blue reticulation edges denote the evolution of diploid 
parthenogenetic lineages through the primary hybrid process (Fig. 1). Purple reticulation edges denote the 
evolution of triploid parthenogenetic lineages through the genome addition process. These edges illustrate 
instances of hybridization between diploid parthenogenetic lineages and sexual species. Parthenogenetic taxa 
are indicated in bold. Some of these, indicated by asterisks, have formed multiple times through independent 
hybridization events between the same sexual species (table S6) (photo credit: A.J.B. and A.N.-M.d.0.). 


evolutionarily ephemeral owing to constraints 
imposed by a lack of genetic recombination 
(Fig. 1). Finally, we see some evidence in sup- 
port of the rare formation hypothesis. A lim- 
ited number of extant unisexual lineages are 
found in nature compared with the theoretical 
number of sexual parental combinations whose 
divergence time falls within this range and 
whose geographic distributions overlap (6 of 
28 combinations). This suggests that specific 
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genotypic combinations may also be required 
to induce changes in reproductive mode. This 
is consistent with laboratory crossing experi- 
ments that have had only rare success in pro- 
ducing parthenogenetic lineages (J6, 17). 

Our results suggest that time since diver- 
gence plays a major role in determining the 
outcomes of hybridization in whiptails, with 
outcomes switching from introgressive hybrid- 
ization to hybrid speciation when the diver- 


gence time between species exceeds ~10 million 
years (Fig. 2, fig. S11, and table S7). There is a 
gap in evolutionary relatedness between pairs 
of whiptails involved in these two types of 
hybridization outcomes [between a genetic 
distance of ~0.0097 and 0.0126 based on the 
restriction site-associated DNA (RAD)-tag 
data]. Therefore, it remains an open question 
whether, within that divergence range, hybrid- 
ization between pairs of whiptails could con- 
ceivably result in either outcome. Predictions 
of the balance hypothesis are similar to those 
made by theoretical models of homoploid 
hybrid speciation that are not accompanied 
by a transition in reproductive mode (18). This 
may reflect the importance of particular types 
of reproductive incompatibilities in both pro- 
cesses that become more likely to arise at 
intermediate levels of divergence. Incompati- 
bilities in meiosis genes could be necessary to 
cause transitions from sexual to partheno- 
genetic reproduction in hybrid individuals 
(79). In homoploid hybrid speciation, incom- 
patibilities that lead to assortative mating 
and discrimination against parental populations 
may be important for initiating reproductive 
isolation in hybrid populations. 

Although patterns regarding the evolution 
of parthenogenesis and mechanisms of poly- 
ploidization appear to be largely conserved in 
Aspidoscelis (Fig. 1), patterns of introgression 
between species exhibit substantial variation 
(fig. S2). In their foundational comparative 
study of speciation, Coyne and Orr (20) have 
demonstrated that the amount of reproduc- 
tive isolation between species of Drosophila 
is correlated with their time since divergence. 
If generalizable, this result predicts that in 
nature, the amount of introgression between 
species should be negatively correlated with 
their time since divergence (2/, 22). We ex- 
pect this relationship because, as species di- 
verge, they should accumulate an increasing 
number of incompatibilities that both de- 
crease the probability that hybrids can serve as 
a bridge for gene exchange and increase the 
probability that introgressed regions will be 
purged by negative selection over time. Con- 
sistent with this prediction, we see a strong 
negative correlation between the admixture 
proportion (i.e., the shared proportion of the 
genome) and the amount of evolutionary di- 
vergence that separates two species that have 
hybridized (Fig. 3B). Within species, allopatric 
whiptail subspecies show high variability in 
admixture proportions, with some values sim- 
ilar to those estimated from sister taxa and 
some much larger (table S8). These results 
are consistent with a polygenic threshold spe- 
ciation model, which has garnered support 
from recent empirical studies (23, 24). Under 
this model, once two lineages reach a thresh- 
old of divergence, Dobzhansky-Muller incom- 
patibilities are typically in sufficient number 
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the F4 ratio) is negatively correlated with their time since divergence (here quantified —_ linear regression with 95% confidence interval (B). ****P < 0.0001; ***P < 0.001; 


by the patristic distance that separates them). (C) Genome-wide measures of 


to begin suppressing introgression. This effect 
may determine whether sufficient reproduc- 
tive isolation has evolved to prevent lineage 
fusion upon secondary contact. Thus, multi- 
ple thresholds may be important to speciation 
and hybridization outcomes in whiptails at 
different time scales. 

Models for speciation with gene flow make 
contrasting predictions about the effects of 
hybridization on diversification, including the 
formation of genomic islands of speciation 
within the genome, contributions to adapta- 
tion and the evolution of new phenotypes, and 
increasing rates of diversification (3, 25, 26). 
To examine the effects that hybridization has 
had on genomic diversity during speciation in 
whiptails, we compared estimates of observed 
heterozygosity, nucleotide diversity, and gene 
diversity between lineages that have been 
formed by different speciation processes (Fig. 
3C and table S9). Parthenogenetic lineages 
have maintained high levels of observed he- 
terozygosity because they are clones of F1 hy- 
brid offspring. Sexual species with a history 
of introgressive hybridization frequently have 
higher estimates of nucleotide and gene diver- 
sity than those that do not. These same values in 
some sexual species even exceed those calculated 
for the parthenogenetic lineages, which likely 
reflects these species having had multiple hybrid 
partners during their evolutionary history. 

Causes and consequences of hybridization 
in whiptails extend beyond the genome and 
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**P < 0.01; *P < 0.05; ns, n 


mode of reproduction. For example, one of the 
earliest hypotheses to explain the geographic 
distribution of parthenogenetic lineages pro- 
posed that climatic fluctuations during the 
Pleistocene may have contributed to shifting 
distributions of species, providing opportuni- 
ties for secondary contact and hybridization in 
doing so (14, 27). This idea is consistent with 
results of an ancestral state biogeographic re- 
construction for whiptails, which suggests that 
many of the parents of parthenogenetic line- 
ages recently dispersed into the areas that they 
now occupy and where these lineages formed 
(fig. S12). Hybridization may also play a role in 
mediating phenotypic evolution. In Aspidoscelis, 
gigantic body size has evolved at least three 
different times (fig. S13). These giants are 
associated with some of the strongest (i.e., 
large D statistics indicating hybridization 
with multiple species) and most notable (i.e., 
with evolutionarily divergent species) signals 
of hybridization in the evolutionary history of 
the clade (table S10). 

Further insights could be gained through 
comparative studies of other unisexual verte- 
brates. In Darevskia lizards, the specificity of 
the sexual species involved in the formation 
of the multiple parthenogenetic lineages and 
the apparent lack of correlation between gene 
flow and divergence have been interpreted as 
support for the phylogenetic constraint hypo- 
thesis (28). The evolutionary divergence sep- 


arating parents of the unisexuals in these 


ot significant. 


two clades appears to be comparable (table 
S11). Cobitis fishes reproduce by gynogene- 
sis, and crossing data in this group are also 
consistent with the balance hypothesis (29). 
The rare formation hypothesis has been a 
favored explanation in the gynogenetic Amazon 
molly (Poecilia formosa) (30). Age estimates 
for the unisexual whiptails based on the 
RAD data are much younger than those for 
P. formosa (median formation times ~160,000 
to 250,000 versus ~500,000 generations ago). 
Resolving the mechanisms that underlie these 
similarities and differences will help advance 
our understanding of some of the longest- 
standing areas of inquiry in biology. 
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Cavity-mediated electron-photon pairs 


Armin Feist'?+, Guanhao Huang*“+, Germaine Arend}, Yujia Yang**}, Jan-Wilke Henke*”, 
Arslan Sajid Raja*“, F. Jasmin Kappert”?, Rui Ning Wang**, Hugo Lourengo-Martins'”, Zheru Qiu>“, 
Junqiu Liu**, Ofer Kfir'?, Tobias J. Kippenberg***, Claus Ropers?2* 


Quantum information, communication, and sensing rely on the generation and control of quantum 
correlations in complementary degrees of freedom. Free electrons coupled to photonics promise novel 
hybrid quantum technologies, although single-particle correlations and entanglement have yet to be 
shown. In this work, we demonstrate the preparation of electron-photon pair states using the phase- 
matched interaction of free electrons with the evanescent vacuum field of a photonic chip—based optical 
microresonator. Spontaneous inelastic scattering produces intracavity photons coincident with energy- 
shifted electrons, which we employ for noise-suppressed optical mode imaging. This parametric pair- 
state preparation will underpin the future development of free-electron quantum optics, providing a 
route to quantum-enhanced imaging, electron-photon entanglement, and heralded single-electron and 


Fock-state photon sources. 


ptical parametric processes generate 

quantum correlations of photons with- 

out changing the state of the optical 

medium involved. Entangled photons 

from parametric down-conversion (7) 
are an essential resource for heralded single- 
photon sources, quantum communication, and 
quantum teleportation (2). Spatially separated 
entanglement afforded by such “twin beam” 
pairs was extended to various physical contexts, 
with examples in atomic beams (3), electron 
spins (4), and photon-phonon correlations in 
micromechanical systems (5). Free-electron 
beams are an emerging target for quantum 
manipulation and sensing, promising quantum- 
enhanced imaging (6, 7), spectroscopy (8-10), 
and excitation (11-14). A variety of technologies 
that bridge electron microscopy and photonics 
(15) are being established to join the most 
powerful probes in atomic-scale imaging and 
spectroscopy, respectively. Among these, stim- 
ulated near-field scattering offers mode-specific 
probing of optical properties, whereas sponta- 
neous electron energy loss and cathodolumi- 
nescence access electronic transitions and the 
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total photonic density of states (15). Structural 
design has been shown to allow for a tailoring 
of the spectral and spatial properties of electron- 
driven radiation (/6, 17). Harnessing quantum 
optics approaches, measurements of photon 
statistics are used to reveal single quantum 
emitters (78) or photon bunching (/9, 20). 
Theoretical work predicted single-photon 
cathodoluminescence into a waveguide (21), 
and recent experiments studied the electron- 
induced excitation of whispering gallery modes 
(22, 23) and optical fibers (24). However, im- 
peded by a lack of mode-specific and suffi- 
ciently strong coupling, correlations between 
single electrons and well-defined photonic 
states have remained elusive. In this study, 
we use spontaneous inelastic scattering via 
the evanescent field of a high-quality factor 
(Q) photonic chip-based optical microresonator 
to generate free-electron cavity-photon pair 
states. We characterize the dual-particle herald- 
ing efficiencies and demonstrate coincidence 
imaging of the cavity mode with a substantial 
background suppression. 

The interaction of electron beams with cav- 
ities and resonant structures represents a uni- 
versal scheme for generating electromagnetic 
radiation. In the quantum optical description, 
the inelastic scattering can be modeled as a 
coupling of free electrons to optical vacuum 
fields (8). Scattering with the evanescent field 
of the optical microresonator, an electron at 
energy E, generates intracavity photons at 
frequencies w in an energy-conserving manner 
(Fig. 1A), described by the scattering matrix 
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S= exp(gqud' —h.c.). Here, a’ is the cre- 
ation operator of the optical mode, gq, is the 
vacuum coupling strength (25, 26), and h.c. 
is the Hermitian conjugate. The ladder oper- 
ator 6 reduces the energy of an electron by that 
of one photon. The interaction induces entan- 
glement between the electron energy and the 
cavity population and results in the state 


|Wes Vpn) = deal ~ nh) |\n) (1) 


(We, electron state; y,, photonic state; ?, reduced 
Planck’s constant) with the coefficients c, = 


2 n 
exp (— leet) et corresponding to Poissonian 
scattering probabilities (25). For a weak vacuum 
coupling strength |qu|<1, the state is dominated 


by the zero- and one-photon contributions 


Wes pn) 1E)|0) + Sul — ho)[1) + O(82,) 
(2) 


where © is order of approximation. Our mea- 
surements are designed to probe this state by 
detecting single photons in coincidence with 
inelastically scattered electrons by the energy 
of one photon. In the experiment, the continuous 
electron beam of a transmission electron micro- 
scope [120-keV beam energy, 25-nm focal diam- 
eter; see (27) section S1.3] traverses a photonic 
chip-based microresonator (Fig. 1C) and inter- 
acts with the vacuum fields to populate empty 
cavity modes. 

The Si,N, microresonator (28) is designed 
for low optical loss, efficient fiber coupling, and 
free-space near-field access [for optical charac- 
teristics, see (27) $1.2]. Enhancing the inter- 
action by electron-light phase matching (29-32), 
we engineered the resonator cross section 
(2.1 um by 650 nm) (26) for maximum photon 
generation probability per electron, which is 
predicted up to 10% around 0.8-eV photon 
energy (a wavelength of 1.55 um). 

Cavity photons generated in the clockwise 
propagating mode (Fig. 1A) are coupled out 
to a bus waveguide and are further guided by 
optical fibers to a single-photon avalanche diode 
(SPAD). The energy and arrival time of each 
electron is measured with an event-based de- 
tector behind a magnetic prism spectrometer 
(Fig. 1B), allowing for electron-photon coin- 
cidence experiments. 

First, we spatially and spectrally map the 
electron-induced cavity excitation by placing 
the electron beam in proximity of the resona- 
tor (Fig. 2, A and B). Measured with an optical 
spectrometer, the overall spectral range of de- 
tected radiation spans from 1520 to 1620 nm, 
primarily limited by the bandwidth of electron- 
light phase matching and consistent with nu- 
merical simulations [Fig. 2C; see (27) $2.2]. 
The spectrum exhibits a comb-like structure 
(Fig. 2D) as a result of free-electron coupling 
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to the microresonator modes a, [1, mode index; 
see (27) $2.1 for a multimode description of the 
interaction]. The 1.58-nm spacing of the emis- 
sion peaks matches the optically characterized 
quasi-transverse magnetic mode free-spectral 
range and confirms the predominant photon 
extraction from this mode family [see (27) S1.2]. 

Spatial characterization of the intracavity 
excitation is obtained by raster scanning of the 
electron beam (Fig. 2, A and B) and detection 
of emitted light with a SPAD. Figure 2E dis- 
plays the obtained single-mode (top) and spec- 
trally integrated (bottom) photon rates, which 
decay exponentially with distance from the 
structure, tracing the near-field mode profile 
in this spectral range. The strongest photon 
generation is observed for electrons passing 
the ring resonator tangentially, as expected 
for phase-matched electron-light interaction. 
Figure 2F shows the photon generation rates 
along the chip surface. The single-mode count 
rate shows a pronounced oscillation in the ra- 
dial distance to the center of the ring resonator. 
This is caused by the interference between se- 
quential interactions (Fig. 2A) of an electron 
with the cavity vacuum field (33), proving the 
conservation of mutual coherence in the scat- 
tering process. Similar Ramsey-type interfer- 
ence is also observed for an externally pumped 
ring resonator (26). Integrating the full mode 
spectrum of the cathodoluminescence aver- 
ages out the oscillatory pattern, in agreement 
with numerical simulations (Fig. 2G). The total 
scattering probability is modeled as the sum 


P=)°> |u|” of the individual mode con- 


tributions Zquy, [see (27) S1.1 for details]. 

The spontaneous creation of photons by sin- 
gle electrons satisfies energy-momentum con- 
servation, lowering the energy of an electron by 
hw and transferring the corresponding mo- 
mentum to the cavity photon (27). We probe 
this correlated electron-photon pair state by 
coincidence measurements of both particles 
(Fig. 3). The electron beam is held fixed in the 
near field of the cavity at a distance of ~160 nm 
from the surface. At this position, we detect 
photons with a probability of 4.6 x 10° per 
electron passing the structure. Considering 
coupling and detection losses [see (27) S1.4], 
this corresponds to an intrinsic generation 
probability of ~2.5%. 

The arrival time and kinetic energy of each 
electron is measured by event-based spec- 
troscopy, using the stream of photon events 
recorded by the SPAD for time tagging [Fig. 1A 
and (27) 81.5 for details]. Figure 3A shows 
the energy- and time-dependent histogram of 
electron arrivals relative to the photon event 
closest in time. The two main features ob- 
served are a time-independent background 
of accidental coincidences around zero energy 
loss and an anticipated sharp correlation peak 
around 0.8-eV energy loss and O-ns time delay. 
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Fig. 1. Coincidence detection of electron-photon 
pairs generated at a photonic chip. (A) High- 
energy electrons traverse a fiber-coupled Si3N4 
microring resonator, generating intracavity photons. 
Single photons and inelastically scattered 

electrons are detected in coincidence. B, magnetic 
field. (B) Electron spectrum after interaction. 

(C) Optical microscope image of the photonic chip 
with a sketch of the passing electron beam. 

r, microresonator radius. 


The spectrum of the correlated electrons (Fig. 3B, 
red) is downshifted by the energy of one photon 
with respect to the zero-loss peak (ZLP) (black). 
As a result of the narrow electron-light phase- 
matching bandwidth of ~50 meV, these spectral 
distributions are very similar in broadening 
and shape [~0.5-eV width; see (27) S1.6 for 
details]. 

We next insert a mechanical slit in the elec- 
tron spectrometer (Fig. 3C) to reject electrons 
that were not scattered, allowing for higher total 
electron flux, and thus photon rates, without 
detector saturation. The background-corrected 
time profile (inset) shows the precise temporal 
structure of the correlated electron-photon 
pairs with a width of 3.91 ns. In principle, the 
temporal correlation trace should reproduce 
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Fig. 2. Spatial and spectral mapping of intracavity photon generation. (A and B) Schematic of the measurement geometry. The electron beam (green) passes 
parallel to the chip surface (gray) and interacts twice (red dots) with the ring resonator (dark red). (C) Optical emission spectrum (blue; electron beam passes 

resonator tangentially) and simulated spectral envelope (gray). (D) Close-up view of the spectral mode comb and a selected single cavity mode (red). (E) Raster- 
scanned photon-generation maps [saturation corrected; see (27) S1.4] for the selected mode and the full spectrum [intensity patterns illustrated in (B) with red and 
blue]. (F) Signals from (E) integrated along the direction of the white arrow. (G) Simulation of the position-dependent electron scattering probability (50-nm distance). 


the cavity decay time, but it is not resolved in 
the present experiments [current device: ~0.5-ns 
lifetime for Q ~ 5.5 x 10°; see (27) S1.2]. 

By selecting loss-scattered electrons within 
a specific time window, we identify correlated 
events on a single-particle basis. The measure- 
ment of the electron energy for each event, in 
contrast to conventional optical spontaneous 
parametric down-conversion, presents a direct 
measure of the energy quanta 7 deposited in 
the optical cavity. The single-electron energy 
detection at the first energy loss sideband 
therefore represents an optical state projection 
from the entangled state (Eq. 1) onto the (non- 
classical) single-photon intracavity Fock state 
(13, 21, 25). This general principle, illustrated in 
the schematic of Fig. 3E, enables interparticle 
heralding schemes for either electrons or 
photons. Notably, interactions with multiple 
electrons—relevant for studying electron-electron 
correlations—can be excluded owing to the 
multihit capability of the detector. 

Such heralded particle sources are quantified 
in terms of the measured rates of electrons (R,), 
photons (R,), and correlated events (Rp.) and 
the Klyshko heralding efficiencies ni. = Rye /R; 
(i, 7 = e, p, 7 #7), which describe the conditional 
probability of experimentally detecting a 
heralded particle [see (27) S1.6]. For the data 
shown in Fig. 3C, we measure nk ~ 0.11% and 
Nk ~ 57% for photons and electrons, respec- 
tively. The less-efficient heralding of photons 
follows from considerably higher losses in 
their output coupling and detection [see (27) 
$1.4]. Not being a fundamental physical limi- 
tation, we expect appreciable improvements 
on the photon collection efficiency with tech- 
nical optimizations, including use of super- 
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Fig. 3. Time- and energy-correlated electron-photon pairs. (A) Two-dimensional correlation histogram 
over the electron energy and relative photon timing (30-s integration, 0.65-pA electron current on detector) 
showing a strong coincidence peak. (B) Spectral distributions of the uncorrelated and correlated electrons 
(background subtracted). (C and D) Close-up view of correlation histogram (ZLP blocked, 60-s integration, 
46-pA beam current at sample) with coincidence time trace (white) and fraction of true coincidences. FWHM, 
full width at half maximum. (E) Illustration of cavity-mediated inelastic electron-photon scattering and 


coincidence measurement. 


conducting detectors and strongly overcoupled 
resonators. Taking into account particle losses 
in transmission and detection, we estimate 
intrinsic heralding efficiencies n} of ~50%. In 
principle, near-unity intrinsic heralding effi- 
ciencies can be achieved (27) [see detailed 
estimate of nj in (27) $2.3]. 


Employing both particle channels facilitates 
correlation-enhanced measurements, isolating 
physical scattering events from uncorrelated 
noise such as detector dark counts. As a figure 
of merit for noise suppression, the fraction of 
true coincidences Rye /(Rpe — Race), With the 
uncorrelated (accidental) background rate Race, 
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Fig. 4. Correlation-enhanced cavity mode imaging. Spatial distributions of (A) photons [cathodolumines- 
cence (CL)], (B) loss-scattered electrons [electron energy-loss spectroscopy (EELS)], and (C) correlated 
events (30-ms integration per pixel, starting ~270 nm above the chip surface; see Fig. 2 for measurement 
geometry). The positions of the resonator and the mode envelope (solid and dashed lines, respectively, serve 
as a guide to the eye) are indicated. (Inset) Sketch of the cavity field decay into the vacuum (scattering 
probability |g|2). The energy windows applied to (B) and (C) are optimized individually [see (27) S1.6]. 

(D) Logarithmic plot comparing the distance-dependent exponential mode decay from (A) to (C) (integrated 
in the area above the resonator, effectively 1-s acquisition per point). The lowest noise and highest dynamic 
range (DR) (more than three orders of magnitude) are observed for the time-correlated imaging. 


is defined for selecting a specific time-delay 
and energy-loss window (Fig. 3D), reaching 
98.6%—i.e., a coincidence-to-accidental ratio 
(CAR) of ~75 [see (27) S1.6]. This demonstrates 
the high-fidelity generation of correlated electron- 
photon pairs, promising quantum-enhanced 
imaging. 

Here we provide a proof-of-concept demon- 
stration of coincidence-gated raster mapping 
of the resonator mode. Specifically, Fig. 4, A 
and B, shows the time-integrated electron and 
photon signals, respectively, and Fig. 4C dis- 
plays the correlated events only. To quantify 
the correlation-induced improvement in image 
contrast, Fig. 4D compares the respective 
count rates for the individual and correlated 
signals on a logarithmic scale. Both the photon 
and electron signals trace the exponential de- 
cay of the evanescent field away from the struc- 
ture, leveling off at constant values for larger 
distances. These background offsets limit the 
dynamic range of the mode imaging and are 
determined by the uncorrelated noise rates— 
i.e., the photodetector dark counts (130 counts/s) 
and residual overlap of the ZLP with the 
energetic gate (1.5 x 10* counts/s), respectively. 
Rejecting the uncorrelated background, the 
correlated signal (Fig. 4D, yellow curve) re- 
solves the evanescent decay of the cavity field 
over longer distances. Despite a slightly smaller 
signal (ng < 1), we improve the dynamic range 
by two orders of magnitude [see (27) $2.5]. 

From a broader perspective, harnessing cor- 
relations of electrons with radiative emission 
(34) shows promise for enhancing contrast and 
resolution, as recently shown in the study of 
core-level (35) and valence electronic excita- 
tions in nanostructured materials (36). Using 
postselection in the electronic and photonic 
degrees of freedom (9, 13), such schemes can 
be generalized to trace state-specific scattering 
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cross sections and create heralded pair states 
as a function of linear or angular momentum, 
polarization or spin, or frequency or energy. 

Our findings demonstrate and apply the 
parametric generation of electron-photon 
pair states. Besides showing the capability for 
contrast enhancement in correlation-gated 
imaging, we implement flexible on-chip electron- 
heralded photon and photon-heralded single- 
electron sources. The integrated photonics 
platform allows for flexible electron-light phase- 
matching and spectral tuning of the generated 
cavity state. Postselection of multiloss electron 
events will facilitate a unique approach to 
generate tailored and higher-order photon 
Fock states. By merging free-electron quantum 
optics with integrated photonics, we provide a 
pathway toward a new class of hybrid quantum 
technology that relies on entangled electrons 
and photons, with applications ranging from 
photonic quantum state synthesis to quaantum- 
enhanced sensing and imaging. 
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Finding a new calling 


hen I arrived at the mountaintop observatory, I was bursting with excitement. I was about to 
finish my junior year studying astrophysics, with plans to apply for a Ph.D. in the field, and I 
had the rare opportunity to go on an observing run. As I arrived at the tiny cabin that would 
be my home for the next few days, I finally felt like an astronomer. Unfortunately, the excite- 
ment didn’t last. As I sat alone, staring at a computer screen instead of the night sky, pressing 
a combination of buttons over and over to control the telescope and retrieve data, I had to 


admit: This wasn’t what I had imagined. 


I had wanted to be an astronomer 
ever since my father gave me the DVD 
box set of Cosmos: A Personal Voy- 
age when I was a teenager. Watching 
Carl Sagan talk about the heavens, I 
thought I had found my calling. But 
after my experience at the observa- 
tory, I had to face a new truth. If this 
was the astronomer’s life, then the 
astronomer’s life wasn’t for me. 

I returned to college on the verge 
of panic. I still wanted to pursue a 
Ph.D. to learn more about the natu- 
ral world and conduct independent 
research one day. But where was my 
future, if not in astronomy? 

I thought I might be better off 
doing research that was more inter- 
disciplinary and collaborative. I re- 
called a quantum physics class I had 
taken and thought that field might 
fit the bill. My academic adviser was 
skeptical. I can’t blame him; I was throwing away a ton of 
original astronomical data to start from scratch in a new 
field, and I only had 2 weeks left in the school year to de- 
sign a new thesis project. But I embraced the challenge. 
As I read and thought about my potential new field, my 
brain buzzed with excitement—and anxiety about what my 
adviser would say. I put together a proposal I was eager to 
pursue, and to my joy and relief, my adviser approved it. 

In some ways, my new project mirrored the observatory 
experience, as I sat for extended periods in a darkened lab, 
aligning lasers and shining them through a pair of slits. But 
being able to shape and control quantum phenomena in- 
spired me in a way astronomy did not. I also recruited a 
friend to work with me, which made the research less lonely. 
As we talked through the project, explaining complex topics 
to each other and reflecting on the big picture of what we 
wanted to achieve, a revelation dawned: I had been drawn 
to Cosmos not because of the subject, but because of the fi- 
nesse with which Sagan explained mind-blowing concepts. 


“New possibilities can be far 
more rewarding than doggedly 
Sticking to a years-old dream.” 


That was the spark of inspiration I 
wanted to follow. 

I began to try out my communica- 
tion skills, working with high school 
physics students and tutoring in the 
college physics room. I got a real 
charge from these experiences. Even 
though they meant I had to work later 
in the lab to get my research done, I 
felt I had finally found my niche. I de- 
cided to pursue a Ph.D. in quantum— 
a field where massive breakthroughs 
are often barely visible to the wider 
world. This was my chance to be both 
a scientist and a communicator. Dur- 
ing my Ph.D., I volunteered at local el- 
ementary schools and participated in 
other communication and outreach 
efforts. Sometimes it was hard to 
justify the amount of time they took 
away from my thesis work, but speak- 
ing to excited students gave me the 
extra bump of motivation I needed to return to my research. 

As I was completing my Ph.D., I knew I wanted to find 
work that let me both contribute to science and draw more 
people into the field. I interviewed for a wide range of jobs, 
including consulting work, industry positions, and even 
professorships at liberal arts colleges. It felt like serendip- 
ity when IBM reached out about two potential jobs, one 
focused on community outreach and the other on quantum 
engineering. I asked whether I could do both. Now, in addi- 
tion to conducting research, I lecture at universities, run a 
summer school, and work with educators. 

It’s not what I imagined, sitting on my living room floor 
as a teenager. But as I’ve learned more about science and 
myself, I’ve realized that remaining open to new possibili- 
ties can be far more rewarding than doggedly sticking to a 
years-old dream. 


Olivia Lanes is a researcher and team lead for education at IBM. 
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Notice is hereby provided that a meeting of the members of the American Association 
for the Advancement of Science (the “Association’’) will convene on September 16, 
2022, to consider proposed amendments to the Association’s Charter that would 
collectively permit the Association’s Board of Directors to adopt, without further 
action, future amendments to the Association’s Constitution and Bylaws and permit 
meetings of the Association’s members to occur anywhere in the United States. The 
full text of the proposed amendments can be found below. The amendments were 
endorsed by the Board of Directors and approved by the Council on July 22, 2022. 


The meeting will take place September 16, 2022, 11 a.m. EDT, at the Boston Marriott 
Newton Hotel (2345 Commonwealth Avenue, Newton, MA 02466). Members wishing 
to attend the meeting virtually will also have the opportunity to do so. Please email 
GovernanceModernization@aaas.org for virtual attendance instructions. 


Votes will be accepted in-person or by proxy vote. Details of the proxy vote will be 
emailed to all members in good standing on or about September 2, 2022. Members 
who are not physically in attendance, including those attending the meeting virtually, 
must cast their vote via this proxy before the meeting. 


Additionally, at the next annual election of Association officers, members will also be 
asked to consider amendments to the Association’s Constitution and Bylaws that would 
permit the Association’s Board of Directors to amend both such governing documents 
without further action. Notice of such election will be provided at a future date. 


Proposed Amendments to the Association’s Charter 


The Charter of American Association for the Advancement of Science, as amended, 
is hereby further amended by inserting the following after Section 3A: 


Section 3B. The directors may make, amend or repeal the by-laws of said 
corporation in whole or in part. 


Section 3C. Meetings of the members may be held anywhere in the United States. 
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The Michelson Philanthropies and Science Prize for _ —— 


Immunology focuses on transformative research in 
human immunology, with trans-disease applications 
to accelerate vaccine and immunotherapeutic 


discovery. This international prize supports investigators 


35 and younger, who apply their expertise to research 
that has a lasting impact on vaccine development and 
immunotherapy. It is open to researchers from a wide 


“The Michelson Philanthropies & Science 
Prize for Immunology will greatly impact 
my future work. As | am just starting my 
scientific career, it will illuminate my work, 
spark interest and support me to continue 
my research in this field.” 


Paul Bastard, MD, PhD, 


range of disciplines including computer science, 
artificial intelligence/machine learning, protein 
engineering, nanotechnology, genomics, parasitology 
and tropical medicine, neurodegenerative diseases, 
and gene editing. 


Laboratory of Human Genetics 
of Infectious Diseases, Imagine 
Institute (INSERM, University of 
Paris), Paris, France; and The 
Rockefeller University, New York. 


Dr. Bastard received the inaugural 
Grand Prize for his essay: “Why 
do people die from COVID-19: 
Autoantibodies neutralizing type 

| interferons increase with age.” 


Application deadline: Oct. 1, 2022. 


For more information visit: 
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